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A NEW ASSAY FOR THE DETECTION AND QUANTITATION OF 

HEMIMETHYLATION 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This patent application claims the benefit of priority to United States Provisional 
Patent Application Serial Number 60/247,191, entitled "A NEW ASSAY FOR THE 

DETECTION AND QUANTITATION OF HEMIMETHYLATION," filed on 08 November 
2000. 

TECHNICAL FIELD OF THE INVENTION 
The present invention relates to DNA methylation, and in particular hemi- 
methylation. The present invention provides a method to measure the fraction of DNA 
molecules that is 

hemi-methylated at a specific CpG dinucleotide in a particular DNA sequence, in a pool of 
DNA molecules having mixed DNA methylation states at said CpG dinucleotide. The 
15 present invention also provides for methods to calculate the rates of de novo-, and 
maintenance-methylation per cell division that occur at said CpG dinucleotide. 

BACKGROUND OF THE INVENTION 
The identification of early genetic changes in tumorigenesis is a primary focus in 
molecular cancer research. Characterization of the nature and pattern of cancer-associated 
genetic alterations will allow for early detection, diagnosis and treatment of cancer. Such 
genetic alterations in vertebrates fall generally into one of three categories: gain or loss of 
genetic material; mutation of genetic material; or methylation at cytosine residues in CpG 
dinucleotides within "CpG islands." Among these, DNA methylation is uniquely reversible 
and changes in methylation state are known to affect gene expression {e.g., transcriptional ' 
initiation of genes where CpG islands located at or near the promoter region) or genomic 
stability. 

In higher order eukaryotic organisms, DNA is methylated only at cytosines located 5' 
to guanosine in the CpG dinucleotide. This modification has important regulatory effects on 
gene expression predominantly when it involves CpG rich areas (CpG islands) located in the 
promoter region of a gene sequence. Extensive methylation of CpG islands has been 
associated with transcriptional inactivation of selected imprinted genes and genes on the 
inactive X chromosome of females. Aberrant methylation of normally unmethylated CpG 
islands has been described as a frequent event in immortalized and transformed cells and has 
been frequently associated with transcriptional inactivation of tumor suppressor genes in 
human cancers. 

DNA methylases transfer methyl groups from a universal methyl donor, such as S- 
adenosyl methionine, to specific sites on the DNA. Mammalian cells possess methylases that 
methylate cytosine residues on DNA that are 5' neighbors of guanine in CpG dinucleotides 
40 (CpG) . This methylation may play a role in gene inactivation, cell differentiation, 
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tumorigenesis, X-chromosome inactivation, and genomic imprinting. CpG islands remain 
unmethylated in normal cells, except during X-chromosome inactivation and parental specific 
imprinting where methylation of 5' regulatory regions can lead to transcriptional repression. 
DNA methylation is also a mechanism for modifying the base sequence of DNA without 
5 altering its coding function. DNA methylation is a heritable, reversible and epigenetic 
change. Yet, DNA methylation has the potential to alter gene expression, which has 
profound developmental and genetic consequences. 

DNA methylation is required for mammalian development (Li et al., 1992; Okano et 
al., 1999) yet the mechanisms responsible for the establishment and copying of methylation 
10 patterns remain almost completely unknown. 5-Methylcytosine is asymmetrically distributed 
in the genome and is most commonly found in CpG-poor regions, since most CpG islands in 
somatic cells remain methylation-free, except for the promoters of imprinted genes and genes 
on the inactive X-chromosome (Bird et aL, 1985). 

Methylation occurs after cytosine has been incorporated into DNA in a process 
1 5 catalyzed by DNA methyltransferases ("Dnmts") which transfer the methyl group from S- 
adenosylmethionine to the 5' -position of the pyrimidine ring in, characteristically but not 
exclusively, the context of the palindromic CpG dinucleotide (Ramsahoye et al., 2000). 
Three Dnmt enzymes are known in mouse and human, and these have overlapping yet 
distinct abilities to methylate ec hemimethylated" and completely unmethylated CpG 
20 dinucleotide pairs. Hemi-methylation is defined as a state in which the two opposing 

cytosines on either DNA strand in a single palindromic CpG dinucleotide differ in that one is 
methylated at the C-5 position, and the other is not. 

The predominant Dnmt in the cell, Dnmtl, was cloned and characterized by Bestor 
and colleagues (1988) and is localized to replication machines in the S-phase nucleus 
25 (Leonhardt et al., 1992; Rountree et al., 2000). Since Dnmtl shows a preference for 
hemimethylated CpG pairs (Gruenbaum et al., 1981; Bestor and Ingram, 1983), it is 
considered to be an excellent candidate for copying the pattern of methylation present on the 
parental strand after DNA has been replicated. However, Dnmtl is capable of modifying 
unmethylated DNA in the test tube, and is thus also a candidate for inducing de novo 
30 methylation. The recently discovered Dnmts, Dnmt3a and 3b (Okano et al, 1998) show 

equal activities in vitro for unmethylated and hemimethylated substrates (Okano et al., 1998), 
and have been shown to be capable of de novo methylation of transfected DNA in culture 
(Hsieh, 1999) and in Drosophila (Lyko et aL, 1999). Interestingly, satellite DNAs appear to 
be a preferred target for the human DNMT3B enzyme, because these satellite DNA 
35 sequences are specifically undermethylated in patients with ICF syndrome, characterized by 
germ-line mutations in the DNMT3B gene (Hansen et aL, 1999; Okano et aL, 1999; Xu et al., 
1999). 

Unfortunately, despite the identification of the above-mentioned Dnmts, and while 
DNA methylation has been implicated in gene regulation and cancer, especially in the context 
40 CpG islands, little is known about the significance, in these processes, of hemimethylation at 
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specific CpG sites, or of the general extent of hemimethylation in CpG-poor DNA regions. 
This knowledge will require methods to measure characteristics (e.g., extent, rate) of 
hemimethylation at specific CpG dinucleo tides. 

Therefore, there is a need in the art to determine the role of specific Dnmts in copying 
5 the methylation patterns of a CpG-poor region of DNA. Additionally, there is a need in the 
art for a method to measure the fraction of DNA molecules that are hemi-methylated at a 
specific CpG dinucleotide in a particular DNA sequence, in a pool of DNA molecules having 
mixed DNA methylation states at said CpG dinucleotide. Furthermore, there is a need in the 
art methods to calculate the rates of de novo-, and maintenance-methylation per cell division 
1 0 that occur at said CpG dinucleotide. 



SUMMARY OF THE INVENTION 
The present invention provides a method to measure the fraction of DNA molecules 
that is hemi-methylated at a specific CpG dinucleotide in a pool of DNA with mixed DNA 
15 methylation states at that CpG dinucleotide, comprising: extracting DNA from the various 
cell types; digesting the DNA sample with an excess of a methylation-sensitive restriction 
endonuclease that cleaves according to a recognition sequence motif comprising a 
palindromic CpG methylation site, whereby the endonuclease cleaves the DNA at a specific 
recognition motif position if the methylation state of the respective palindromic CpG 
20 methylation site is un-methylated, but not hemi-methylated or fully-methylated (e.g., HpaH); 
treating the DNA with bisulfite; amplifying the treated DNA by PCR with primers flanking 
the specific recognition motif position (e.g., the HpaH site); and determining the degree of 
hemi-methylation by averaging results obtained by sequence analysis of individual cloned 
PCR products for the presence of C or T at the first position of the specific CpG dinucleotide. 
25 Preferably, analysis of individual cloned PCR products for the presence of C or T at the first 
position of the specific CpG dinucleotide was accomplished using a methylation-sensitive 
single nucleotide primer extension (Ms-SNuPE) assay (Gonzalgo and Jones, 1997). 

According to the present invention, the average rates per cell division of de novo 
methylation and of maintenance methylation are dictated by just two measurable variables: S 
30 = the percent of cytosines within CpGs found to be unmethylated by bisulfite analysis of one 
of the two single DNA strands, with prior HpaH digestion; and P = the percent of cytosines 
within CpGs found to be methylated by bisulfite analysis of one of the two DNA strands, 
without prior HpaH digestion. 

The present invention further provides a novel set of equations to calculate the rate of 
35 de novo methylation (n) and the rate of maintenance methylation (m) at a single CpG 
dinucleotide within a HpaH site. 

The present invention further provides a method calculate the rate of de novo 
methylation (n) and the rate of maintenance methylation (m) at a single CpG dinucleotide 
within a HpaH site comprising: determining a value for a variable S and for a variable P, 
40 where S = the percent of cytosines within CpGs found to be unmethylated by bisulfite 
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analysis of one of the two single DNA strands, with prior Hpall digestion, and where P = the 
percent of cytosines within CpGs found to be methylated by bisulfite analysis of one of the 
two DNA strands, without prior Hpall digestion; and calculating the percent of HpaU- 
resistant molecules with double-stranded (full) methyJation (D) as: D=100-2S; determining 
5 the value of H=2SP/(S+D) and F=DP/(S+D) and U=100-H-F, based thereon, whereby the 
rate of de novo methylation (n) at this CpG dinucleotide is defined as the fraction of 
unmethylated post-synthesis CpGs that undergoes conversion to fully methylated CpG per 
cell division: n = l-(U/(U+H/2)), and whereby the rate of maintenance methylation (m) at this 
CpG dinucleotide is defined as the fraction of post-synthesis hemimethylated CpGs that 

10 undergoes conversion to fully methylated CpG per cell division: m = l-(H/(F+H/2)). 

The present invention further provides a hemimethylation detection kit useful for 
measuring the fraction of DNA molecules that is hemi-methylated at a specific CpG 
dinucleotide in a pool of DNA with mixed DNA methylation states at that CpG. 

The present invention further provides a hemimethylation detection kit useful for 

15 measuring the rate of de novo methylation (») and the rate of maintenance methylation (m) at 
a single CpG dinucleotide within a palindromic CpG methylation site. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 A shows the patterns of methylation in ES Cells. The methylation status of 
20 the four CpG sites in individual molecules of DNA (Fragment Cl-f) comprising the 440 bp 
sequence were assessed by cloning of individual molecules followed by MS-SNuPE analysis 
of the four sites as indicated. 

Figure IB shows the methylation patterns of individual DNA molecules within the 
three cell types examined according to the present invention, where N = the number of DNA 
25 molecules assessed. 

Figure 2 shows the detection and quantitation of hemi-methylation in individual 
cloned DNA molecules according to the present invention. The experimental approach used 
to detect hemi-methylation in the ES cells comprises precutting genomic DNA with Hpall, 
followed by bisulfite treatment, then cloning individual PCR products, and assessing the 
30 methylation status of, e.g., site 2 by MS-SNuPE analysis. 

Figure 3 shows the measured steady state methylation levels, methylation rates, and 
an overview of the procedure used to calculate de novo (n) and maintenance (m) methylation 
rates (expressed as the fraction of substrate molecules converted per cell division) at a single 
CpG dinucleotide within a Hpall site, starting with the measurement of two experimental 
35 variables ("P" and "S"). 

Figure 4A summarizes results from such pulse-chase experiments which show that 
DNA synthesized during a 1 hr pulse of BrdU in wild type M1/3A/3B cells was not 
methylated to its final level until 24-48 hr after synthesis. 

Figure 4B shows that the delayed methylation of DNA requires active DNA 
40 methyltransferase enzymes. 
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Figure 5 shows the maturation of methylation pattern cells. The methylation status of 
individual DNA molecules was determined immediately after a 1 hr pulse with BrdU, or after 
a chase period (24 hr) in medium not containing BrdU. 

Figures 6A,B and C illustrate the proposed model for the interactivity between DNA 
methyltransferase enzymes in ES cells. 

Figure 6A depicts how methylation patterns are maintained in wild-type cells 
containing all three DNA methyltransferase enzymes. 

Figure 6B indicates how methylation is proposed to occur in cells lacking both 
Dnmt3a and Dnmt3b enzymes. 

Figure 6C shows the situation in M3A/3B cells, which lack Dnmtl. A mosaic 
methylation pattern is present with patches of methylation and unmethylation and a 
substantial fraction of hemi-methylated sites. 

Figure 7A shows methylation sensitive fingerprints of M1/3A/3B, M3A/3B, and Ml 
ES cell DNA after enzyme digestion with each of Rsal, Rsal with Hpall, or Rsal with Mspl 
for 1 6 h. Bands which appear to be hypomethylated only in lane 2 are indicated by solid 
arrowheads (Class I). Bands which appear to be hypomethylated in both lanes 2 and 3 are 
indicated by open arrowheads (Class II). Functional DNA methyltransferase genes are 
indicated by solid squares and inactivated genes are indicated by open squares. GCP1, 2, 4 
are GC poor primers. 

Figure 7B shows a Southern blot analysis of genomic DNA from M1/3A/3B, 
M3A/3B, and Ml ES cell DNA using isolated AP-PCR fragment Cl-f. 

Figure 7C shows a Ms-SNuPE analysis for fragment CH-d from M1/3A/3B, M3A/3B, 
and Ml ES cell DNA Quantitative methylation analysis of three CpG sites in CH-d. The 
percent methylation represents the average of three site using the following equation on 
Phospholmager quantitation (methylated)C/[(methyIated)C + (unmethylated)T] X 100. CpG 
sites are represented by tick marks; striped boxes represent repetitive sequences; arrows 
represent PCR primers. 

Figure 8 shows the sequence properties of hypomethylated fragments isolated from 
methylation sensitive fingerprints in ES cells (Figure 7A CpG sites are represented by tick 
marks; striped boxes represent repetitive sequences; upward arrows represent HpaE sites. 
♦Indicates bands that are not included in Figure 7A. 

Figure 9 shows the patterns of Methylation in ES Cells at Different Regions. The 
methylation status of individual molecules of DNA from the Cl-f (Class I), CH-d and A- 
repeat regions (Class II) were assessed by cloning of individual bisulfite converted molecules 
followed by Ms-SNuPE or automated sequencing. CpG sites are represented by tick marks. 
Methylation status of CpG dinucleotides: (black wedges/circles) methylated; (white 
wedge/circles) unmethylated. Horizontal rows of wedges/circles indicate individual 
molecules that were sequenced after PCR amplification and cloning of bisulfite-treated DNA. 
The striped region indicated repeat sequences. 
40 Figure 10A shows the distribution of methylation status at Cl-f and CH-d regions, and 
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an overview of the procedure used to calculate distribution of methylation status (folly 
methylated, hemimethylated, and folly unmethylated) at a single CpG dinucleotide within a 
Hpall site, starting with the measurement of two experimental variables ("P" and "S"). A 
description of each variable is given in the first column, along with the equation used to 
5 derive the calculated variables (see Experimental Procedures). The second column illustrates 
the subset of methylation states representing that variable (white box), as well as all the other 
methylation states being assessed in that particular step (gray box). The unmethylated CpGs 
are absent in the case of the S and D variables, since the HpaJI digestion removes these 
molecules from consideration at this step. The data obtained for the three cell lines for the 
10 measured variables "P" and "S" are shown in the columns on the right as percentages, 

followed by the absolute numbers of molecules assessed. Control experiments demonstrated 
that the top and bottom strand are equal in their methylation levels and rates. On average, 
differences between the newly synthesized strand and daughter strand after DNA replication 
should be distributed equally between the top and bottom strand in a large population of cells. 
15 Figure 10B shows the distribution of methylation status of Cl-f and Cll-d. F, 

indicates folly methylated; H, hemimethylated; U, folly unmethylated. 

Figure 1 1 shows the recovery of methylation after 5-Aza-CdR treatment. The 
indicated cell types were treated for 24 h with 3x1 0" 7 M 5-aza-CdR and medium changed and 
the cells propagated further. DNA was extracted at the times indicated after the drug was 
20 added to the cultures and the methylation status of the Cl-f (black) and CH-d fragments 

(white) measured by Ms-SNuPE analysis as indicated. Results given are the average values 
of two separate experiments. 

Figure 12 shows the methylation kinetics oJf newly synthesized DNA at Cl-f and CII- 
d. ES cells containing the genes for the indicated DNA methyltransferases were pulsed for 1 
25 hr with BrdU. The pulse was then removed and fresh medium added to the cells during the 
chase period. The DNA was extracted from the cells at various times after the pulse began 
and immunoprecipitated to isolate the BrdU containing DNA. The methylation status of 
Hpall sites in Cl-f and Cll-d at the indicated time points were determined by quantitative 
MS-SNuPE analysis. The data represent the mean values in two to six experiments and the 
30 error bars indicate the standard deviations. 



DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

35 "Sodium bisulfite treatment" refers to the art-recognized treatment of genomic 

DNA with sodium bisulfite, whereby unmethylated cytosines in the DNA are converted to 
uracil by deamination, and where methylated cytosine residues are conserved (Frommer et 
b1. 9 P.NjL& USA 89:1827-31, 1992). Subsequent PCR amplification replaces the uracil 
residues with thymines and the 5-methylcytosine residues with cytosines. 

40 "GC Content 55 refers, within a particular DNA sequence, to the [(number of C bases 
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+ number of G bases) / band length for each fragment], 

"CpG Island" refers to a contiguous region of genomic DNA that satisfies the 
criteria of (1) having a frequency of CpG dinucleotides corresponding to an 
"Observed/Expected Ratio" >0.6), and (2) having a "GC Content" >0.5. CpG islands are 
typically, but not always, between about 0.2 to about 1 kb in length. 

"Hpall" refers to the art-recognized restriction endonuclease having the palindromic 
"CCGG" (top strand): "GGCC" (bottom strand) recognition motif that comprises two CpG 
dinucleotide sequences (one on each DNA strand) that function as potential methylation sites. 

"Rsal" refers to the art-recognized restriction endonuclease having the "GTAC" (top 
strand): "CATG" (bottom strand) recognition motif. 

"Excess" of a methylation-sensitive restriction endonuclease used to digest DNA 
refers to an amount sufficient, under the reaction conditions, to bring about essentially 
complete digestion of DNA, before bisulfite treatment to ensure accuracy. 

"Methylation state" refers to the presence or absence of 5-methylcytosine ("5- 
mCyt") at one or a plurality of CpG dinucleotides within a DNA sequence. Methylation 
states at one or more particular palindromic CpG methylation sites (each having two CpG 
CpG dinucleotide sequences) within such a DNA sequence include "unmethylated," "fully- 
methylated" and "hemimethylated." 

"Hemimethylation" refers to the methylation state of a palindromic CpG methylation 
site, where only a single cytosine in one of the two CpG dinucleotide sequences of the 
palindromic CpG methylation site is methylated (e.g., 5'-CC M GG-3' (top strand): 3'-GGCC- 
5' (bottom strand)). 

"Hypermethylation" refers to the methylation state corresponding to an increased 
presence of 5-mCyt at one or a plurality of CpG dinucleotides within a DNA sequence of a 
test DNA sample, relative to the amount of 5-mCyt found at corresponding CpG 
dinucleotides within a normal control DNA sample. 

"Hypomethylation" refers to the methylation state corresponding to a decreased 
presence of 5-mCyt at one or a plurality of CpG dinucleotides within a DNA sequence of a 
test DNA sample, relative to the amount of 5-mCyt found at corresponding CpG 
dinucleotides within a normal control DNA sample. 

"De Novo methylation" refers to the conversion of unmethylated post-synthesis CpG 
dinucleotide sequences (within a palindromic CpG methylation site) to fully methylated CpG 
sequences. 

"Maintenance methylation" refers to the conversion of post-synthesis hemi- 
methylated CpG dinucleotide sequences (within a palindromic CpG methylation site) to fully 
methylated CpG sequences. 

"Methylation assay" refers to any assay for determining the methylation state of a 
CpG dinucleotide within a sequence of DNA. 

"MS.AP-PCR" (Methylation-Sensitive Arbitrarily-Primed Polymerase Chain 
Reaction) refers to the art-recognized technology that allows for a global scan of the genome 
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using CG-rich primers to focus on the regions most likely to contain CpG dinucleotides, and 
described by Gonzalgo et ah, Cancel' Research 57:594-599, 1997. 

"MethyLight" refers to the art-recognized fluorescence-based real-time PCR 
technique described by Eads et al., Cancer Res. 59:2302-2306, 1999. 
5 "Ms-SNuPE" (Methylation-sensitive Single Nucleotide Primer Extension) refers to 

the art-recognized assay described by Gonzalgo & Jones, Nucleic Acids Res. 25:2529-2531, 
1997. 

"MSP" (Methylation-specific PCR) refers to the art-recognized methylation assay 
described by Herman et al. Proc. Natl. Acad. Sci. USA 93:9821-9826, 1996, and by US Patent 
10 No. 5,786,146. 

"COBRA" (Combined Bisulfite Restriction Analysis) refers to the art-recognized 
methylation assay described by Xiong & Laird, Nucleic Acids Res. 25:2532-2534, 1997. 

"MCA" (Methylated CpG Island Amplification) refers to the methylation assay 
described by Toyota et al., Cancer Res. 59:2307-12, 1999, and in WO 00/26401A1. 
15 "DGE" refers to art-recognized 2-dimensional gel electrophoresis (2-DGE) methods. 

Overview 

The present invention provides a novel and straightforward technique to quantitate 
hemimethylation levels. Prior to the development of this method, there had been no accurate 
20 way to determine hemimethylation levels at specific CpG dinucleotides in the genome. 

Additionally, according to the present invention, the steady-state levels of hemimethylation 
and full methylation at a CpG dinucleotide provide sufficient information to allow calculation 
of the rates of de novo methylation and of maintenance methylation. The present invention 
provides a unique set of equations that reflect this concept. This led to the discovery that 

25 cells containing only the Dnmt3a and 3b enzymes show significantly more maintenance 

methyltransferase activity at this site than de novo methyltransferase activity. This contrasts 
with the prior view of these enzymes as predominantly de novo methyltransferases, but 
supports our proposed concept of post-Dnmtl repair maintenance methyltransferases. 

Thus, the present invention provides provides a method to measure the fraction of 

30 DNA molecules that is hemi-methylated at a specific CpG dinucleotide in a particular DNA 
sequence, in a pool of DNA molecules having mixed DNA methylation states at said CpG 
dinucleotide. The present invention also provides for methods to calculate the rates of de 
novo-, and maintenance-methylation per cell division that occur at said CpG dinucleotide. 

The region selected for study was part of the transcribed region of a gene expressed in 

35 mouse mammary and thymus tissue. It was representative of the situation in most of the 

mammalian genome in which CpG islands, frequently located in promoters, are methylation 
free and most of 5-methylcytosine is found in coding regions and repetitive elements (Bird et 
al., 1985; Yoder et al., 1997; Jones, 1999). Because of the complexities of dealing with at 
least three enzymes with overlapping activities, the patterns and kinetics of methylation in 

40 wild-type cells were compared with cells containing either a homozygous mutation of the 
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Dnmtl gene or with a double Dnmt3a/3b knockout. Pulse chase experiments showed that, 
contrary to current assumptions, DNA was not methylated to its final level immediately after 
synthesis, but rather that methylation occurred in two phases. The data are most consistent 
with a model whereby Dnmtl acts soon after replication but leaves gaps of hemimethylated 
DNA which are then filled in by DnmOa and/or 3b, which therefore have roles as 
"methylation repair" enzymes in addition to their potential roles as de novo 
methyltransferases. 
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Experimental Procedures 

ES cell lines. ES cell culture, transfection, and selection were carried out as 
described previously (Li et al., 1992). Jl (M1/3A/3B) is a wild-type ES cell line of inbred 
129/Sv background (Li et al., 1992). The Dnmtr'(M3A/3B), DnmtSa'QAl/SB), DnrntSb^ 
(M1/3A) and \Dnmt3d'\ DnmtJb ^fMl) ES cells were previously described by Okano et al 
(1999). 

Pulse-chase experiments. ES cells were seeded at 9 X 10 s cells per 60 mm dish with 
feeder cells 72 hr before being pulsed for 1 br in 2.5 ml medium containing 1 X 10" 4 M BrdU 
(Sigma). The BrdU-containing medium was then removed, the cells washed once with 
regular medium at 37°C then chased with 4 ml medium supplemented with 1 X 1 0"* M 
thymidine (Sigma). For inhibition studies, the BrdU was also chased with 10" 6 M 5-Aza-CdR 
(Sigma), 0.1 ng/ml Trichostatin (TSA) (Sigma), 5 X 10" 6 M Aphidicolin (Sigma), or 50 ng/ml 
Colcemid (Sigma). Genomic DNA was obtained following lysis with 100 mM NaCl, 10 mM 
EDTA, 1% SDS, and 1 ug/ml Proteinase K and purified by phenol and chloroform 
extractions and ethanol precipitation. The yield and purity of the DNA was determined by 
optical density measurement and 300 ug of DNA digested with 3000 units of Rsal (Roche) 
for 16 hr at 37°C. Digested DNA was purified again by phenol and chloroform extractions, 
ethanol precipitated, and resuspended in 500 ul TE buffer pH 7.5 with 50 ul 10X 
immunoprecipitation buffer (sodium phosphate (pH 7.0)), 0.14 M NaCl, 0.05% Triton X- 
100). The samples were denatured at 95°C for 5 min, cooled on ice for 2 min, and 
immediately mixed with 2.4 ul anti-BrdU antibody (25 ug/ml) per ug DNA (Becton 
Dickinson) (Rideout et al., 1994). After 30 min incubation at room temperature with mild 
rocking, 0.46 ul anti-mouse IgG rabbit (2.6 mg/ml) per ug DNA (Sigma) was added and 
incubation continued for another 30 min at room temperature. The precipitate was collected 
after 5 min centrifugation at 13,000 rpm in a cold microcentrifuge, dissolved in 200 ul TE pH 
7.5 and treated with Proteinase K at 50"C for more than 12 hr. DNA was deproteinized by 
phenol and chloroform extractions and ethanol precipitated. 

MsSNuPE. The Ms-SNuPE technique is a quantitative method for assessing 
methylation differences at specific CpG sites based on bisulfite treatment of DNA, followed 
by single-nucleotide primer extension (Gonzalgo & Jones, Nucleic Acids Res. 25:2529-2531, 
1997). Briefly, genomic DNA is reacted with sodium bisulfite to convert unmethylated 
cytosine to uracil while leaving 5-methylcytosine unchanged. Amplification of the desired 
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target sequence is then performed using PCR primers specific for bisulfite-converted DNA, 
and the resulting product is isolated and used as a template for methylation analysis at the 
CpG site(s) of interest. Small amounts of DNA can be analyzed (e.g., microdissected 
pathology sections), and it avoids utilization of restriction enzymes for determining the 
5 methylation status at CpG sites. Typical reagents (e.g., as might be found in a typical Ms- 
SNuPE-based kit) for Ms-SNuPE analysis may include, but are not limited to: PCR primers 
for specific gene (or methylation-altered DNA sequence or CpG island); optimized PCR 
buffers and deoxynucleotides; gel extraction kit; positive control primers; Ms-SNuPE primers 
for specific gene; reaction buffer (for the Ms-SNuPE reaction); and radioactive nucleotides. 

10 Additionally, bisulfite conversion reagents may include: DNA denaturation buffer; 

sulfonation buffer; DNA recovery regents or kit (e.g., precipitation, ultrafiltration, affinity 
column); desulfonation buffer; and DNA recovery components. 

Methylation-sensitive single nucleotide primer extension (Ms-SNuPE) assays. The 
mean cytosine methylation levels of CpG sites 2, 3 in the fragment were determined by 

15 treatment of DNA (2 jag) with sodium bisulfite according to Frommer et ah (1992). 

Methylation analysis was performed using the methylation-sensitive single nucleotide primer 
extension (Ms-SNuPE) assay (Gonzalgo and Jones, 1997). 
For Examples 1-5. below : 

The sequences of the primers used for bisulfite-treated DNA amplification were as 

20 follows: Top strand: 5' primer (5 ' - AATGTGTAAT ATTTTT ATGGTTTTTTT AGAATGG- 
3') (SEQ ID NO:l), 3' primer (5 '-TTACAAAAAAAATACCTCTTCCTTACTAAAC-3 ') 
(SEQ ID NO:2). Bottom strand: 5' primer (5 ' -TGGT ATTTGGAAAG ATTGTAGGAAAG- 
3') (SEQ ID NO:3) 3* primer (5'-CAATACCTCTATAACCCTTCCAAA-3 , ) (SEQ ID 
NO:4). PCR reactions were performed in 25 \A total volume under the following conditions: 

25 100-200 ng bisulfite treated DNA, 20 mM Tris-HCl (pH 7.5), 2.5 mM MgCl 2 , 100 mM KC1, 
0.5 \xM final concentration of each primer, 200 jaM each of the four deoxynucleotide 
triphosphates, and 0.5 U Taq polymerase (Sigma) complexed with Taq polymerase antibody 
(Clontech). DNA was initially denatured at 94°C for 5 min, 94°C for 1 min, 56°C for 50 sec, 
and 72°C for 1 min for 35 cycles, with a final extension at 72°C for 4 min. PCR products 

30 were gel purified with the Qiaquick Gel Extraction Kit (Qiagen), and the template was 
resuspended in 30 p.1 H 2 0. 

Ms-SNuPE primers', for top strand site 2 (5 '-AATAATTTTGTTTTTTTGGATATT- 
3') (SEQ ID NO:5), site 3 (5 * - AAATTTTGTTTTTGGTTGTAAA-3 ' ) (SEQ ID NO:6), for 
bottom strand site 2 (5 AGGAATAGAATTTGAGATATT-3 ') (SEQ ID NO:7), site 3 (5'- 

35 TAAATTGTTTTAATTAGATT AATAA-3 ') (SEQ ID NO:8). 

Ms-SNuPE reactions. Ms-SNuPE reactions were performed in 10 \x\ total volume 
under the following conditions: 4 \x\ Qiaquick product, 20 mM Tris-HCl (pH 7.5), 2.5 mM 
MgCl 2j 100 mM KC1, 0.5 pM final concentration of each primer, and 1 pCi of either 
[ 32 P]dCTP or [ 32 P]dTTP. Primer extension conditions were: 95°C for 2 min, 50°C for 2 min, 

40 72°C 1 min. The reactions were combined with 4 \x\ stop solution before being denatured at 
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95°C for 5 min and loaded onto a 15% denaturing polyacrylamide gel (7M urea). 
Quantitation of methylation levels was performed on a Molecular Dynamics 
Phosphorlmager. 
For Example s 6-8. helnw 

The PCR primers were designed specifically to amplify only bisulfite converted DNA 
and control experiments showed non-amplification of unconverted DNA with these primers. 
Also, sequencing of PCR products after bisulfite treatment showed less than 1% residual Cs 
at not CpG sites indicating that the assays were valid for methylation status at CpG sites. The 
sequences of the primers used for bisulfite-treated DNA amplification were as follows: 
CLf. (Top strand) 5' primer, 5'-AATGTGTAATATTTTTATGGTTTTTTTAGAATGG-3' 
(SEQ ID NO:l), 3' primer, 5 ' -TTAC AAAAAAAATACCTCTTCCTTACTAAAC-3 ' (SEO 
IDNO:2). V 

£L£ (Bottom strand) 5' primer 5 , -GAGTAAAGATAGAATAAATTGTTTTAATTAG-3 , 
(SEQ ID NO: 16) and 3' primer, 5 '-CAATACCTCTATAACCCTTCCAAA-3 ' (SEO ID 
15 NO:4). 

Old. 5' primer, 5 '-GTTTATAGGTTTAGAGGTTTT-3 ' (SEQ ID NO:17), 3' primer 5'- 
AACACATAAAC CTATTTTAAACTTA-3' (SEQ ID NO:18). 

d=£spmt&. 5' primer, 5 '-TGATTTATTTATTAGAGGTTTTAGG-3 ' (SEQ ID NO:19), 3' 
primer, 3 '-AC ATAAAAAAACAAACTACCC-3 ' (SEQ ID NO:20). 

PCR Conditions. £L£. 95°C for 2 min, 95°C for 1 min, 56°C for 50 sec, and 72°C for 
1 mm for 35 cycles, with a final extension at 72°C for 10 min. CIT-d : 95°C for 2 min, 94°C 
for 1 min, 50°C for 50 sec, and 72°C for 1.5 min for 40 cycles, with a final extension at 72°C 
for 10 min. A-repeats: 95°C for 2 min, 94°C for 1 min, 50°C for 50 sec, and 72°C for 1 min 
for 40 cycles, with a final extension at 72°C for 10 min. PCR products were gei purified with 
the Qiaquick Gel Extraction Kit (Qiagen), and the template was resuspended in 30 ul H 2 0. 

Ms-SNuPE Primers. £L£. 5 '-AATAATTTTGTTTTTTTGGATATT-3 ' (SEQ ID 
NO:5) (Hpan site); QLd: 5 '-TTTTATTTATTGTTATTATGG-3 ' (SEQ ID N021) (site 1) 
S'-GGTATAGTTTGAGTAT-3' (site 2) (SEQ ID NO:22), 5- 
T ATTTTTTAAT AGTATT ATTTTTTAT-3 ' (SEQ ID NO:23) (site 3, HpaH Site). 

Ms-SNuPE reactions were performed in 10 ul total volume under the following 
conditions: 4 ul Qiaquick product, 20 mM Tris-HCl (pH 7.5), 2.5 mM MgCl 2 , 1 00 mM KC1, 
0.5 uM final concentration of each primer, and 1 uCi of either [ 32 P]dCTP or [ 32 P]dTTP. 
Primer extension conditions of Cl-f and Cll-d were: 95°C for 1 min, 46°C for 30 sec, 72°C 
for 20 sec. The reactions were combined with 4 ul stop solution before being denatured at 
95°C for 5 min and loaded onto a 15% denaturing polyacrylamide gel (7M urea). 
Quantitation of methylation levels was performed on a Molecular Dynamics 
Phosphorlmager. 

Bisulfite genomic sequencing by MsSnuPE. Bisulfite Genomic Sequencing by 
Ms-SNuPE or Automated DNA Sequencer. Traditionally patterns are determined by the 
cloning of individual bisulfite treated molecules followed by DNA sequencing. It was more 
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convenient to subject individually cloned molecules to Ms-SNuPE analysis to rapidly assess 
how the four sites were methylated (Cl-f and Cll-d). The PCR products from bisulfite 
converted DNA were ligated into pCRII cloning vector (Invitrogen, San Diego, CA). 
Individual plasmid clones were amplified by Ml 3 primers (forward and backward). The 
5 PCR product was used to sequence by Ms-SNuPE. The condition is the same as above. The 
primers for sequence are for CL£. site 1 (5 '-TATGGTTTTTTTAGAATGG-3 ') (SEQ ID 
NO:9), site 2 (5 '-AATAATTTTGTTTTTTTGGATATT-3 ') (SEQ ID NO:10), site 3 (5'- 
AAATTTTGTTTTTGGTTGTAAA-3 ') (SEQ ID NO:l 1), site 4 (5'- 
TTGTTGTTATGTGTAATTATTTTTT-3 ') (SEQ ID NO:12); and for CII-d (4 sites):5 9 - 

10 TTTTATTTATTGTTATT ATGG-3 ' (SEQ ID NO:24), 5 '-GGTATAGTTTGAGTAT-3 ' 
(SEQ ID NO:25), 5 ' -TTTTTT AATAAGGTTATATTTTTT-3 5 (SEQ ID NO:26), 5- 
TATTTTTTAAT AGTATTATTTTTTAT-3 9 (SEQ ID NO:27). 

A-repeats clones were sequenced by automated DNA sequencer at USC/Norris 
Comprehensive Cancer Center microchemical facility. 

15 5-Aza-CdR Treatments. Cells were plated (2xl0 6 cells/60 mm dish) and treated 24 h 

later with 3x1 0" 7 M 5-Aza-CdR (Sigma). The medium was changed 24 h after drug treatment 
and every subsequent day. DNA was isolated at the indicated days after treatment as 
described (Pfeifer, et al. P.N.A.S. USA 87:8252-8256, 1990). 

Hemimethylation assay. The fact that the enzyme Hpall will not cut the sequence 

20 CCGG in either the fully or hemimethylated configuration followed by bisulfite treatment 
was used to determine whether unmethylated cytosine occurred in the context of a CpG 
sequence in a Hpall insensitive site. Two -4 jug DNA were digested by Rsal and Hpall (1 0 
units per jig) for 16 hr at 37°C and then Hpall added (10 units per fig) for another 2 hr. The 
digests were then analyzed for methylation at site 2 by cloning of individual PCR-amplified 

25 bisulfite treated molecules as described above. 

Methylation Equations. The relative amounts of full methylation, hemimethylation, 
and lack of methylation of a CpG dinucleotide within a Hpall site can be derived from two 
measurable variables: S = the percent of unmethylated cytosines found by bisulfite analysis 
of one of the two single DNA strands, after Hpall digestion; and P = the percent of 

30 methylated cytosines found by bisulfite analysis of one of the two DNA strands, without prior 
Hpall digestion. 

First, the percent of Hpall-resistant molecules with double-stranded (full) methylation 
(D) is calculated as: D^l-28. In this equation, it is assumed that the top and bottom strands 
are equal in their methylation levels and rate {see Fig. 10 for additional support of this). On 
35 average, differences between the newly synthesized strand and daughter strand after DNA 
replication should be distributed equally between the top and bottom strand in a large 
population of cells. 

The amount of hemimethylation (H) can be calculated as: H=2SP/(S+D) and full 
methylation (F) as: F=DP/(S+D). The basis for these two equations is the assumption that the 
40 ratio between hemimethylation and full methylation is constant for a particular DNA sample. 
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Therefore, this ratio, which can be determined from the measurement of S (but which is 
lacking information on unmethylated DNA), can be applied to the measurement of all 
methylation within a single strand (P). Calculation of the percent unmethylated CpGs (U) 
can be calculated from H and F as: U=100-H-F. 
5 Rate calculations. At equilibrium, the percent of DNA molecules that are 

unmethylated (U), hemimethylated on either strand (H), or fully methylated on both strands 
(F) at a CpG dinucleotide contained within a HpaE site, remains constant. Therefore, the 
relative ratios of these three methylation states resulting from each cell division as a 
consequence of the combined effects of de novo methylation and of maintenance methylation 
10 at this CpG should remain constant. If one assumes that de novo methylation of 

unmethylated post-synthesis CpGs results in full, double-stranded DNA methylation, then the 
average rates per cell division of de novo methylation and of maintenance methylation are 
dictated by just two measurable variables: S = the percent of cytosines within CpGs found to 
be unmethylated by bisulfite analysis of one of the two single DNA strands, with prior HpaE 
15 digestion; and P = the percent of cytosines within CpGs found to be methylated by bisulfite 
analysis of one of the two DNA strands, without prior HpaH digestion. 

A novel set of equations was developed to calculate the rate of de novo methylation 
(n) and the rate of maintenance methylation (m) at a single CpG dinucleotide within a HpaH 
site, using just these two measured input variables. First, the percent of J^paH-resistant 
20 molecules with double-stranded (full) methylation (D) as: D=100-2S is calculated. 

From this, H=2SP/(S+D) and F=DP/(S+D) and LM100-H-F can then be calculated. 
The rate of de novo methylation («) at this CpG dinucleotide, is defined as the fraction of 
unmethylated post-synthesis CpGs that undergoes conversion to fully methylated CpG per 
cell division: n = l-(U/(U+H/2)). The rate of maintenance methylation (m) at this CpG 
25 dinucleotide is defined as the fraction of post-synthesis hemimethylated CpGs that undergoes 
conversion to fully methylated CpG per cell division: m = 1 -(H/(F+H/2)). 

Methylation-Sensitive AP-PCR. Methylation-sensitive AP-PCR and isolation of 
fragments of interest were performed as previously described Gonzalgo, et al., Cancer 
Research 57:594-599, 1997; Liang, et al., Genomics 53:260-268, 1998). The following CpG- 
30 poor primers were used for AP-PCR analysis: GCP 1 , 5 , -CACATGGTTCTGC-3 , (SEQ ID 
NO:13); GCP2, 5'-GTCTCTATGACCC-3' (SEQ ID NO: 14); GCP4, 5'- 
CTTACTGTGCCAC-3* (SEQ ID NO:15). The following pairs were used in the AP-PCR 
reaction: GCP1/GCP2, GCP2/GCP4, GCP1/GCP4. 

Southern Blot Analysis of DNA from Cell Lines. Ten ug of DNA from cell lines 
35 were separately digested with either 50 units of Rsal, 50 units each of Rsal and HpalL, or 50 
units each of Rsal and Mspl (Roche) at 37°C for 16 hr. Digested genomic DNA was 
electrophoresed on 0.7% agarose gel and Southern transferred to Zeta-Probe (Bio-Rad) 
membranes overnight. Cloned DNA fragments that were previously isolated from AP-PCR 
polyacrylamide gels were subsequently used as probes for hybridization to these filters 
40 Approximately 100 ng of plasmid insert DNA were Vlabeled by random priming and used 
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' to probe the filters. All hybridizations were performed in 500 mM NaP04 (pH 6.8), 7% 
SDS, 1 mM EDTA (pH 8.0) at 65°C for 16 hr. Membranes were washed twice at 65°C with 
2XSSC/1%SDS followed by three washes with 0.5X SSC/1% SDS at room temperature. 
They were then exposed to autoradiographic film at -80°C. Quantitation of methylation levels 
5 was performed on a Molecular Dynamics Phosphorlmager. 

Methylation Assay Procedures. Various methylation assay procedures are known in 
the art, and can be used in conjunction with the present invention for sequence 
determinations. These assays allow for determination of the methylation state of one or a 
plurality of CpG dinucleotides (e.g., CpG islands) within a DNA sequence. Such assays 
10 involve, among other techniques, DNA sequencing of bisulfite-treated DNA, PCR (for 
sequence-specific amplification), Southern blot analysis, use of methylation-sensitive 
restriction enzymes, etc. 

For example, genomic sequencing has been simplified for analysis of DNA 
methylation patterns and 5-methylcytosine distribution by using bisulfite treatment (Frommer 
15 et al., Proc. Natl Acad. Set USA 89:1827-1831, 1992). Additionally, restriction enzyme 

digestion of PCR products amplified from bisulfite-converted DNA is used, e.g., the method 
described by Sadri & Hornsby (Nucl. Acids Res. 24:5058-5059, 1996), or COBRA 
(Combined Bisulfite Restriction Analysis) (Xiong & Laird, Nucleic Acids Res. 25:2532-2534, 
1997). 

20 COBRA. COBRA analysis is a quantitative methylation assay useful for determining 

DNA methylation levels at specific gene loci in small amounts of genomic DNA (Xiong & 
Laird, Nucleic Acids Res. 25:2532-2534, 1997). Briefly, restriction enzyme digestion is used 
to reveal methylation-dependent sequence differences in PCR products of sodium bisulfite- 
treated DNA. Methylation-dependent sequence differences are first introduced into the 

25 genomic DNA by standard bisulfite treatment according to the procedure described by 

Frommer et al. (Proc. Natl. Acad. Sci. USA 89:1827-1831, 1992). PCR amplification of the 
bisulfite converted DNA is then performed using primers specific for the interested CpG 
islands, followed by restriction endonuclease digestion, gel electrophoresis, and detection 
using specific, labeled hybridization probes. Methylation levels in the original DNA sample 

30 are represented by the relative amounts of digested and undigested PCR product in a linearly 
quantitative fashion across a wide spectrum of DNA methylation levels. In addition, this 
technique can be reliably applied to DNA obtained from microdissected paraffin-embedded 
tissue samples. Typical reagents (e.g., as might be found in a typical COBRA-based kit) for 
COBRA analysis may include, but are not limited to: PCR primers for specific gene (or 

35 methylation-altered DNA sequence or CpG island); restriction enzyme and appropriate 
buffer; gene-hybridization oligo; control hybridization oligo; kinase labeling kit for oligo 
probe; and radioactive nucleotides. Additionally, bisulfite conversion reagents may include: 
DNA denaturation buffer; sulfonation buffer; DNA recovery regents or kit (e.g., 
precipitation, ultrafiltration, affinity column); desulfonation buffer; and DNA recovery 

40 components. 
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Preferably, assays such as "MethyLight" (a fluorescence-based real-time PCR 
technique) (Eads et al., Cancer Res. 59:2302-2306, 1999), Ms-SNuPE (Methylation-sensitive 
Single Nucleotide Primer Extension) reactions (Gonzalgo & Jones, Nucleic Acids Res. 
25:2529-2531, 1997), methylation-specific PCR ("MSP"; Herman et al., Proc. Natl Acad 
Sci. USA 93:9821-9826, 1996; US Patent No. 5,786,146), and methylated CpG island 
amplification («MCA»;Toyota et al., CancerRes. 59:2307-12, 1999) are used alone or in 
combination with other of these methods. 

MethyLight. The MethyLight assay is a high-throughput quantitative methylation 
assay that utilizes fluorescence-based real-time PCR (TaqMan ®) technology that requires no 
further manipulations after the PCR step (Eads et al., CancerRes. 59:2302-2306, 1999). 
Briefly, the MethyLight process begins with a mixed sample of genomic DNA that is 
converted, in a sodium bisulfite reaction, to a mixed pool of methylation-dependent sequence 
differences according to standard procedures (the bisulfite process converts unmethylated 
cytosine residues to uracil). Fluorescence-based PCR is then performed either in an 
"unbiased" (with primers that do not overlap known CpG methylation sites) PCR reaction, or 
in a "biased" (with PCR primers that overlap known CpG dinucleotides) reaction. Sequence 
discrmnnation can occur either at the level of the amplification process or at the level of the 
fluorescence detection process, or both. 

The MethyLight may assay be used as a quantitative test for methylation patterns in 
the genomic DNA sample, wherein sequence discrimination occurs at the level of probe 
hybridization. In this quantitative version, the PCR reaction provides for unbiased 
amplification in the presence of a fluorescent probe that overlaps a particular putative 
methylation site. An unbiased control for the amount of input DNA is provided by a reaction 
m which neither the primers, nor the probe overlie any CpG dinucleotides. Alternatively a 
qualitative test for genomic methylation is achieved by probing of the biased PCR pool with 
either control oligonucleotides that do not "cover" known methylation sites (a fluorescence- 
based version of the "MSP" technique), or with oligonucleotides covering potential 
methylation sites. 

The MethyLight process can by used with a "TaqMan®" probe in the amplification 
process. For example, double-stranded genomic DNA is treated with sodium bisulfite and 
subjected to one of two sets of PCR reactions using TaqMan® probes; e.g., with either 
biased primers and TaqMan® probe, or unbiased primers and TaqMan® probe The 
TaqMan® probe is dual-labeled with fluorescent "reporter" and "quencher" molecules and is 
designed to be specific for a relatively high GC content region so that it melts out at about 10 
°C higher temperature in the PCR cycle than the forward or reverse primers. This allows the 
TaqMan® probe to remain fully hybridized during the PCR annealing/extension step As the 
Taq polymerase enzymatically synthesizes a new strand during PCR, it will eventually reach 
the annealed TaqMan® probe. The Taq polymerase 5' to 3' endonuclease activity will then 
displace the TaqMan® probe by digesting it to release the fluorescent reporter molecule for 
quantitative detection of its now unquenched signal using a real-time fluorescent detection 
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system. 

Typical reagents (e.g., as might be found in a typical MethyLight-based kit) for 
MethyLight analysis may include, but are not limited to: PCR primers for specific gene (or 
methylation-altered DNA sequence or CpG island); TaqMan® probes; optimized PCR 
5 buffers and deoxynucleotides; and Taq polymerase. 

Ms-SNntPE. The Ms-SNuPE technique is a quantitative method for assessing 
methylation differences at specific CpG sites based on bisulfite treatment of DNA, followed 
by single-nucleotide primer extension (Gonzalgo & Jones, Nucleic Acids Res. 25:2529-253 1, 
1997). Briefly, genomic DNA is reacted with sodium bisulfite to convert unmethylated 
10 cytosine to uracil while leaving 5-methylcytosine unchanged. Amplification of the desired 
target sequence is then performed using PCR primers specific for bisulfite-converted DNA, 
and the resulting product is isolated and used as a template for methylation analysis at the 
CpG site(s) of interest. Small amounts of DNA can be analyzed (e.g., microdissected 
pathology sections), and it avoids utilization of restriction enzymes for determining the 
15 methylation status at CpG sites. Typical reagents (e.g., as might be found in a typical Ms- 
SNuPE-based kit) for Ms-SNuPE analysis may include, but are not limited to: PCR primers 
for specific gene (or methylation-altered DNA sequence or CpG island); optimized PCR 
buffers and deoxynucleotides; gel extraction kit; positive control primers; Ms-SNuPE primers 
for specific gene; reaction buffer (for the Ms-SNuPE reaction); and radioactive nucleotides. 
20 Additionally, bisulfite conversion reagents may include: DNA denaturation buffer; 

sulfonation buffer; DNA recovery regents or kit (e.g., precipitation, ultrafiltration, affinity 
column); desulfonation buffer; and DNA recovery components. 

MSP. MSP (methylation-specific PCR) allows for assessing the methylation status of 
virtually any group of CpG sites within a CpG island, independent of the use of methylation- 
25 sensitive restriction enzymes (Herman et al. Proc. Natl Acad. Sci. USA 93:9821-9826, 1996; 
US Patent No. 5,786,146). Briefly, DNA is modified by sodium bisulfite converting all 
unmethylated, but not methylated cytosines to uracil, and subsequently amplified with 
primers specific for methylated versus unmethylated DNA. MSP requires only small 
quantities of DNA, is sensitive to 0.1% methylated alleles of a given CpG island locus, and 
30 can be perfonned on DNA extracted from paraffin-embedded samples. Typical reagents 
(e.g., as might be found in a typical MSP-based kit) for MSP analysis may include, but are 
not limited to: methylated and unmethylated PCR primers for specific gene (or methylation- 
altered DNA sequence or CpG island), optimized PCR buffers and deoxynucleotides, and 
specific probes. 

35 MCA. The MCA technique is a method that can be used to screen for altered 

methylation patterns in genomic DNA, and to isolate specific sequences associated with these 
changes (Toyota et al., Cancer Res. 59:2307-12, 1999). Briefly, restriction enzymes with 
different sensitivities to cytosine methylation in their recognition sites are used to digest 
genomic DNAs from primary tumors, cell lines, and normal tissues prior to arbitrarily primed 

40 PCR amplification. Fragments that show differential methylation are cloned and sequenced 
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after resolving the PCR products on high-resolution polyacrylamide gels. The cloned 
fragments are then used as probes for Southern analysis to confirm differential methylation of 
these regions. Typical reagents (e.g., as might be found in a typical MCA -based kit) for 
MCA analysis may include, but are not limited to: PCR primers for arbitrary priming 
5 Genomic DNA; PCR buffers and nucleotides, restriction enzymes and appropriate buffers; 
gene-hybridization oligos or probes; control hybridization oligos or probes. 

It will be obvious to those skilled in the art that other art-recognized sequencing 
methods, such as direct sequencing, can be employed for purposes of the present invention, 
and that the above-described methylation assays are but examples of such methods that can 
10 be used. 

Kits for Detection of Hemimethylated CpG-containing Nucleic Acid. The present 
invention provides a hemimethylation detection kit for measuring the fraction of DNA 
molecules that is hemi-methylated at a specific CpG dinucleotide in a pool of DNA with 
mixed DNA methylation states at that CpG. The present invention further provides a 
15 hemimethylation detection kit useful for measuring the rate of de novo methylation (n) and 
the rate of maintenance methylation (m) at a single CpG dinucleotide within a HpaH site. 

The reagents required to perform one or more art-recognized methylation assays 
(including the Ms-SnuPE assay identified above) are combined with specific primers or 
probes to determine the hernimethylation state of CpG-containing nucleic acids, or the rate of 
20 de novo methylation (») and the rate of maintenance methylation (m) at a single CpG 
dinucleotide within a HpaH site, according to the present invention. 

For example, the Ms-SnuPE methylation assay could be used alone or in combination 
with other methylation assay methods known in the art, along with specific primers or probes 
to determine the hemimethylation state of a CpG dinucleotide within a particular genomic 
25 sequence. 
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EXAMPLE 1 
Methylation Patterns in ES Cells 

Detailed experiments were conducted on a 440-bp DNA fragment corresponding to a 
mouse EST (BE646975. 1) (Fragment Cl-f; GenBank accession number AK014019) 
expressed in mammary and thymus gland and therefore representative of the coding region of 
a typical mammalian gene in that it was both CpG poor and methylated. A single HpaH site 
and three additional CpG sites (Fig. 1 A) which are residually methylated in the knockout ES 
cells were present in the fragment. The methylation status of the four CpG sites in individual 
DNA strands in cells containing all three Dnmts (M1/3A/3B cells) or containing Dnmtl (Ml 
cells) or Dnmt3a and 3b only (M3A/3B cells) were assessed in detail to investigate potential 
differences in the patterns of methylation. The nomenclature used to describe the cell types 
was selected to focus on which enzymes were present in the cells rather than those which • 
were absent. Minimal differences in methylation levels and patterns were seen in cells 
40 lacking either Dnmt 3a or 3b only. 
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Initially, the methylation patterns at sites 2-4 were determined by cloning of 
individual bisulfite treated molecules followed by DNA sequencing (top fragments in Fig. 
IB). Typically however, the individually cloned molecules were subjected to Ms-SNuPE 
analysis to rapidly assess how the four sites were methylated (Figure 1 A). 
5 Figure 1 A shows the patterns of methylation in ES Cells. The methylation status of 

the four CpG sites in individual molecules of DNA (Fragment Cl-f) comprising the 440 bp 
sequence were assessed by cloning of individual molecules followed by MS-SNuPE analysis 
of the four sites as indicated. Example gels of samples labeled with either dCTP "C" or 
dTTP "T" are shown. The presence of methylation at a given site is indicated by a band in 

10 the "C" lane of the Ms-SNuPE gel whereas a band in the T lane indicates no methylation. 
CpG sites in individual molecules are scored as being methylated (filled boxes) or 
unmethylated as shown (open boxes) so that patterns can be deduced. 

Heterogeneous methylation patterns in M1/3A/3B cells and Ml cells were apparent 
with the four individual sites being methylated with differing efficiencies as indicated in 

15 Figure IB. 

Figure IB shows the methylation patterns of individual DNA molecules within the 
three cell types examined according to the present invention, where N = the number of DNA 
molecules assessed. The data from the upper section within each panel were obtained by 
direct sequencing of fragments containing sites 2 through 4. The bottom sections are for all 

20 four sites. The status of methylation at the sites in individual molecules is read from left to 
right. Figure IB also shows the average percent of methylation at the four sites in the three 
cell types examined. 

Sites 2 and 4 were most methylated on average whereas sites 1 and 3 were less 
efficient methyl acceptors. The methylation pattern in the M3A/3B cells were, however, 

25 completely different in that two classes of molecules were present. The majority of 

molecules in which all four sites were simultaneously assessed were methylation-free (72%), 
14% had only one site methylated but 14% had multiple sites methylated on a single 
molecule. 

This data was most consistent with Dnmt3a/3b being processive or co-operative 
30 enzymes, which tended to methylate sites sequentially although not with 1 00% efficiency, 
since sites 2 and 4 were more methylated than sites 1 and 3. The differential levels of 
methylation at the four sites made it difficult to distinguish between the possibilities that the 
enzymes act processively or rather that methylation of one site increases the probability of 
methylation of adjacent sites. ES cells containing Dnmt3a or 3b as the sole methyltransferase 
35 were not tested, and therefore it was difficult to effectively distinguish between any 

differential specificity between the enzymes. Analysis of the methylation status of the 
cytosines on the paired bottom strands revealed no strand bias in any of the cell types 
analyzed. 

40 EXAMPLE 2 
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A New Assay for the Detection and Quantitation of Hemimethylation 

Bisulfite sequencing has opened the door for the detailed analysis of methylation 
patterns (Frommer et al., 1992), yet it has not heretofore been possible to assess the degree of 
hemi-methylation (i.e., when only a single cytosine in the CpG palindrome is methylated), 
5 because the parental paired DNA strands are separated before analysis. Knowledge of the 
degree of hemi-methylation is crucial to understanding how patterns are copied. 

The fact that the enzyme HpaH will not cut the recognition motif sequence CCGG in 
either the fully or hemi-methylated configuration (Gruenbaum et al., 1981) followed by 
bisulfite treatment was used to determine whether unmethylated cytosine occurred in the 
1 0 context of a CpG sequence in a HpaH-resistant site (see Figure 2). 

Figure 2 shows the detection and quantitation of hemi-methylation in individual 
cloned DNA molecules according to the present invention. The experimental approach used 
to detect hemi-methylation in the ES cells comprises precutting genomic DNA with HpaH, , 
followed by bisulfite treatment, then cloning individual PCR products, and assessing the 
methylation status of, e.g., site 2 by MS-SNuPE analysis. Typical results are shown for these 
MS-SNuPE results in which a signal in the T lanes of example clones indicates hemi- 
methylation with the top strand being unmethylated. The presence of a signal in the "C" lane 
indicates that the site may be either hemi-methylated (i.e., the bottom strand is unmethylated) 
or fully methylated. The distribution between these two scenarios is determined by 
calculation, assuming that there is an equal probability of hemi-methylation of either the top 
or bottom strand. 

Specifically, DNA extracted from the various cell types was digested with an excess 
of Hpan, bisulfite treated and then amplified by PCR with primers flanking the HpaH site 
(site 2). Since the HpaH site is located between the primers, molecules without full or hemi- 
methylation of site 2 would not be amplified and thus be excluded from the analysis. The 
degree of hemi-methylation at a particular site can be accurately determined by averaging 
results obtained by analysis of individual cloned PCR products for the presence of C or T at 
site 2 (for example). Since only the top strand is analyzed in this approach and methylation 
of either the top or bottom strand is possible, the degree of hemi-methylation was calculated 
as indicated in "Experimental Procedures," above. These calculations assume that there is an 
equal probability of hemi-methylation of either the top or bottom strand, a supposition which 
was supported by additional experiments (see Examples 6-8 below). 

This approach allowed for the determination of the distribution of methylation of site 
2 with respect to complete unmethylation, hemi-methylation and full methylation in the three 
35 different cell types (Figure 3). 

Figure 3 shows the measured steady state methylation levels, methylation rates, and 
. an overview of the procedure used to calculate de novo (») and maintenance (m) methylation 
rates (expressed as the fraction of substrate molecules converted per cell division) at a single 
CpG dinucleotide within a HpaH site, starting with the measurement of two experimental 
40 variables ("P" and «S»). A description of each variable is given in the first column, along 
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with the equation used to derive the calculated variables (see "Experimental Procedures," 
above). The second column illustrates the subset of methylation states representing that 
variable (white box), as well as all the other methylation states being assessed in that 
particular step (gray box). The unmethylated CpGs are absent in the case of the S and D 
5 variables, since the HpaU digestion removes these molecules from consideration at this step. 
The data obtained for the three cell lines for the measured variables "P" and "S" are shown in 
the columns on the right as percentages, followed by the absolute numbers of molecules 
assessed. It is assumed that the top and bottom strand are equal in their methylation levels 
and rates. On average, differences between the newly synthesized strand and daughter strand 

10 after DNA replication should be distributed equally between the top and bottom strand in a 
large population of cells. 

As expected, the great majority of sites in wild type cells were fully methylated with 
only a minor proportion of hemimethylated sites (6%). The high percentage (19%) of hemi- 
methylated sites in Ml cells (i.e., containing Dnmtl only), was not anticipated since it was 

1 5 assumed that this enzyme would be highly efficient as a hemi-methylase and would rapidly 
convert half-methylated sites to full methylation following DNA replication. Since there 
were fewer hemi-methylated sites in the wild-type cells containing all three enzymes 
(M1/3A/3B), the data indicate that Dnmt3a/3b are acting as efficient hemi-methylases in 
addition to their roles as de novo methylases. 

20 Figure 3 also shows the rates of de novo and maintenance methylation which could be 

calculated from this data (see "Experimental Procedures," above). Interestingly, Figure 3 
shows that M3A/3B cells had higher rates of hemi-methylase activity (0.37) than of de novo 
methyltransferase activity (0.07). These results indicate that Dnmtl and Dnmt3a/3b acted 
coordinately to ensure a low proportion of hemi-methylated sites in the wild type cells. 

25 

EXAMPLE 3 
Kinetics of Methylation of Newly Synthesized DNA 

The timing of methylation with respect to DNA synthesis was investigated. The 
demonstration that Dnmtl is localized at the replication machinery in the S-phase nucleus 
30 (Leonhardt et al., 1992), and interacts with PCNA (Chuang et al., 1997) has led to the 

expectation that the methylation pattern is copied immediately after synthesis (Araujo et al., 
1998), although earlier studies suggested that some methylation might be delayed so that 
cytosine modification occurred in a biphasic fashion (Adams, 1971; Woodcock et al., 1982, 
Woodcock et al., 1986). 

35 A 1 hr BrdU pulse, followed by chasing for various times was used to investigate the 

timing of methylation with respect to synthesis. DNA containing BrdU was isolated by 
immunoprecipitation (Rideout et al., 1994) and the methylation status of sites 2 and 3 within 
the DNA fragment (see Fig. 1 A for map) measured by quantitative Ms-SNuPE analysis. 

Figure 4A summarizes results from such pulse-chase experiments which show that 

40 DNA synthesized during a 1 hr pulse of BrdU in wild type M1/3A/3B cells was not 

20 

BNSDOCID: <WO 023881 1A2_I_> 



10 



W ° 02/38811 PCT/USOl/47,4, 

methylated to its final level until 24-48 hr after synthesis. ES cells containing the genes for 
the indicated DNA methyltransferases were pulsed for 1 hr with BrdU. The pulse was then 
removed and fresh medium added to the cells during the chase period. The DNA was 
extracted from the cells at various times after the pulse began and immunoprecipitated to 
isolate the BrdU containing DNA. The methylation status of sites 2 and 3 were averaged at 
the indicated time points after determination by quantitative MS-SNuPE analysis. The data 
represent the mean values for the determinations on the top and bottom strands in two to three 
experiments and the error bars indicate the standard deviations. 

Some methylation was therefore delayed with respect to synthesis and was also seen 
in cells containing Dnmtl in combination with either Dnmt3a or 3b, but was less marked in 
Ml cells (Figure 4A). The methylation level of DNA synthesized in M3A/3B cells was 
significantly lower than that in the other cell types and showed little increase until 48 hr after 
synthesis. The data indicated that the post-synthesis methylation in the wild-type M1/3A/3B 
cells might be due to the activities of Dnmt3a and 3b, because the degree of post-synthesis 
15 methylation was less noticeable in the Ml cells, which lacked these two enzymes. 

Figure 4B shows that the delayed methylation of DNA requires active DNA 
methyltransferase enzymes. Cells containing all three DNA methyltransferase enzymes 
(M1/3A/3B) were pulsed for 1 hr with BrdU and DNA collected after 1 hr from some of the 
cultures. The medium in the remaining cultures was changed to medium containing the 
indicated compounds during the chase period and DNA collected 24 hr after the beginning of 
the experiment. BrdU-containing DNA was immunoprecipitated and the level of methylation 
in this DNA determined by quantitative MS-SNuPE analysis. 

The delayed methylation on the newly synthesized DNA strand in the wild-type 
M1/3A/3B cells required the presence of active Dnmts, because treatment with 5-aza-2>- 
deoxycytidine at the time the BrdU pulse was removed resulted in no further methylation of 
the pulsed strand (Figure 4B). Because the drug forms a covalent bond with Dnmts, it 
induces a rapid "transient knockout" of active enzymes (Taylor and Jones, 1982). 

Figure 4B also shows that the maturation process was not blocked by drugs which 
inhibit histone deacetylase (Trichostatin A, "TS A"), DNA synthesis (Aphidicolin) or mitosis 
30 (Colcemid). 

Thus, active methyltransferases but not histone acetylation, DNA synthesis or mitosis 
were necessary for the delayed methylation to occur. 
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EXAMPLE 4 
Patterns of Methylation of Newly Synthesized DNA 
The above experiments yielded information of the average methylation levels of two 
of the four sites with respect to time after synthesis. Individual DNA strands collected at 1 hr 
and 24 hr after BrdU pulse and chase, respectively, were cloned to see how the patterns 
evolved on a molecule by molecule basis by plotting the number of sites concurrently 
methylated per strand at 1 hr and 24 hr (Figure 5). 
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Figure 5 shows the maturation of methylation pattern cells. The methylation status of 
individual DNA molecules was determined immediately after a 1 lir pulse with BrdU, or after 
a chase period (24 hr) in medium not containing BrdU. DNA collected from these two time 
points was immunoprecipitated to isolate BrdU-containing DNA, and the methylation status 
5 of individual molecules determined to assess the methylation density of these molecules as a 
function of time after synthesis as in Figure 1 . Data are given as the percentage of individual 
molecules containing between 0 and 4 sites concurrently methylated on an individual 
molecule. 

Surprisingly, DNA collected immediately after the 1 hr pulse from M1/3A/3B cells, 
10 showed that a considerable proportion (38%) of molecules contained none or only one of the 
four sites methylated and a relatively even distribution of methylation densities. The pattern 
shifted substantially in the 24 hr chase period with most of the molecules now acquiring 
concurrent methylation at three or four sites. Newly synthesized DNA in Ml cells had 40% 
of molecules with zero or one site methylated, 9% having two sites and 50% having three or 
15 four sites methylated. Again, this pattern shifted during the chase period, but the mode of the 
density distribution was two sites methylated per molecule as compared to the three in 
M1/3A/3B cells. 

Thus, newly synthesized DNA contained unmethylated "patches" in both cell types as 
manifest by the occurrence of molecules with all four sites simultaneously unmethylated 
20 which were very much reduced in the post-synthesis phase. This reduction was most likely 
achieved by Dnmt3a and/or 3b, in the wild type cells since a less striking change occurred in 
the Ml cells which lacked these enzymes. 

The same pattern analysis was not performed on the M3A/3B cells, because the 
majority of molecules (72%) were unmethylated at equilibrium (see Figure 1), making it 
25 technically burdensome to clone and assess a large enough number of them for meaningful 
interpretation. 

Example 5 

Subcellular Localization of DNA Methyltransferases 

30 If the discontinuity observed in the kinetics of post-synthetic restoration of DNA 

methylation levels was indeed attributable to distinct methyltransferase activities, with 
Dnmtl mainly responsible for maintenance activity immediately following DNA synthesis 
and Dnmt3a and/or 3b being active later in the cell cycle, then Dnmt3a and/or 3b would not 
be expected to be as strongly associated with toroidal replication foci, as has been observed 

35 for Dnmtl. 

The subcellular localization of the three methyltransferases was determined by the 
transfection of plasmids containing the three genes fused to green fluorescence protein (GFP) 
into mouse C3H 10T1/2 C18 cells. Cells were double stained with an antibody to PCNA 
labeled with Texas red and examined by confocal microscopy 48-72 hr after transfection. 
40 Cells transfected with DNMT-1 confirmed earlier studies that this protein often co-localized 
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with PCNA, particularly in toroidal structures presumably representing replication machines 
at late S phase (Leonhardt et al., 1992; Chuang et al., 1997; Rountree et al., 2000). 

Dnmt3a and 3b were localized almost entirely to the nucleus and were more diffusely 
present in nuclei containing these toroids. Interestingly, Dnmt3b, unlike Dnmt3a showed 
5 some co-localization with the PCNA-containing toroids. 

Thus, Dnmt3b, but not Dmnt3a may associate with replication machines. 

EXAMPLE 6 

(Genome scanning was used to identify sequences to investigate methylation patterns) 

) The findings of Examples 1-5 (above) were extended. Specifically, a genome 

scanning approach was used to investigate the patterns of methylation in the various 
knockout ES cells in CpG poor and CpG rich regions to determine the roles of the enzymes in 
carrying out the bulk of methylation in mouse embryonic stem (ES) cells. 

The methylation levels of CpG-poor sequences were, in general, uniforrnly reduced in 
Dmntl-deficient cells. However, there was considerable variability among different regions 
in the efficiency with which DNA methylation was retained in Dnmt3a/3b-deficient cells 
indicating a sequence preference of the Dnmtl enzyme. 

MS AP-PCR was used to fingerprint the methylation patterns in a panel of ES cells 
containing different combinations of DNA methyltransferases (Dnmtl (-/-) or Dnmt3a (-/- 
)/3b (-/-)). The MS AP-PCR method allows for a methylation partem to be easily obtained 
and relies on the differential susceptibilities of unmethylated and methylated CCGG sites to 
cutting by the enzyme HpaH giving a valid fingerprint of the methylation status of CpG 
islands. The purpose and advantage of Ms AP-PCR was to perform a rapid and global screen 
of genome which would then allow identification of representative sequences for more 
detailed analysis. We initially focused on CpG poor regions of DNA in the current work, 
since these are the regions of DNA in which the majority of 5-methylcytosine is found. 

Figure 7A shows an example of an analysis of DNA extracted from the various cell 
types using CpG poor primers to target regions of DNA not located in CpG islands. The 
nomenclature used for the cells was selected to highlight which gene products were active in 
the cells rather than those which were absent. Analysis of the fingerprints showed a uniform 
loss of methylation at most evaluable bands in cells which contained Dnmt3a and 3b only 
(M3 A/3B), as compared to the other cell types examined (see lanes 2 and bands indicated by 
filled in arrows). The decreased intensities in the Rsa/HpaH digests of DNA from these cells 
suggested a generalized decrease in methylation in cells without a functional Dnmtl enzyme 
and a lack of marked sequence specificity of the retained Dmnt3a and 3b, since no bands 
were apparent which were completely methylated or unmethylated relative to the wild-type or 
mutant cell types. 

An important distinction was observed in cells which contained Dnmtl only (Ml 
cells, lane 3). Their methylation pattern was similar to that of the wild-type cells for many of 
the bands. However there were 6/12 evaluable bands present in which clearly decreased 
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methylation was apparent (indicated by open arrows). This contrasted to the situation in the 
M3A/3B cells in which there was a generalized decrease in all sequences examined. Since 
the AP-PCR method is only semiquantitative, the level of methylation of some of the bands 
with decreased intensities on AP-PCR gels were assessed using either Southern blot analysis 
5 or Ms-SNuPE analysis (Figures 7B and 7C). Results confirmed the decreases apparent from 
the AP-PCR gels, thus validating the approach. 

There were, therefore, two classes of fragments visible in the fingerprints. We 
defined Class / sequences as those that had decreased methylation in M3A/3B cells (z.e., 
lacking Dnmtl), but close to normal methylation levels in Ml cells. Class II sequences, on 

10 the other hand showed loss of methylation in both Ml and M3A/3B cells. 

Figure 8 shows the sequence properties of a selection of these two classes of 
fragments with respect to the occurrence of the CpG dinucleotides and the presence of 
repetitive elements. Class I sequences tended to have fewer repetitive elements, and a slightly 
lower CpG density than Class II sequences. Clearly, this distinction was not absolute since 

15 there were no obvious differences with respect to these properties such as the fragments Cl-b 
and Cll-a, or Cl-a and Cll-b. 



Methylation Patterns on Individual DNA Molecules. The analyses described above 
provided insight into the levels of methylation at individual CpG dinucleotides, but they did 

20 not yield information on the patterns of methylation at these regions. The sequencing of 

cloned PCR products of bisulfite-treated genomic DNA can reveal the patterns of methylation 
in individual DNA molecules. Therefore, such an analysis was performed on one of the 
Class I sequences, and two of the Class II sequences. These experiments clearly illustrated 
the differences in methylation levels in the different knockout cells (Figure 9). 

25 The 440-bp fragment Cl-f sequence (GenBand Accession number AK014019) 

showed extensive methylation of all four sites sequenced in the wild-type cells and extensive 
methylation also in the Ml cells but a marked decrease in M3A/3B cells on both the top and 
bottom strands, which had approximately equal methylation levels at the CpG sites in all cell 
types. Analysis of the top strand of individual molecules in the M3 A/3B cells showed a 

30 strong asymmetry of patterns in that 72% of molecules were completely unmethylated, 14% 
had only one site methylated, but 14% had multiple sites methylated on a single molecule. 
The differential levels of methylation at the four sites made it difficult to distinguish between 
the possibilities that the enzymes act processively, or rather that methylation at one site 
increases the probability of methylation of adjacent sites. 

35 Figure 9 also shows the dramatic decrease in the level of methylation of the Cll-d 

fragment containing an LTR/MALR element in both the Ml and M3A/3B cells. There was 
substantial methylation in the wild-type cells but virtually no methylation in the Ml cells and 
low levels of methylation in M3 A/3B cells. However the data clearly showed that there was 
little methylation of the four sites in this fragment in the Ml cells, in contrast to the Cl-f 

40 fragment, which still showed substantial levels of methylation in Ml cells. A similar 
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situation was also observed in the A-repeats (i.e., the promoters) of LINE-1 repetitive 
elements previously shown to be methylated in M3A/3B cells. Again, virtually no 
methylation occurred in Ml cells and slightly higher levels of methylation were seen in 
M3A/3B cells. 

5 Since Dnmtl is thought to be the major cytosine-5 DNA methyltransferase in 

mammals, responsible for most, if not aU maintenance activity in the cell, the identification of 
a class of sequences that appeared to be poorly maintained by Dnmtl alone, was interesting. 
DnmOa or 3b did not seem to be independently responsible for the methylation of these sites, 
since these sequences were also substantially demethylated in M3A/3B cells (Figures 7 and 
1 0 9). Furthermore, the effects of Dnmtl and Dnmt3a/3b did not appear to be merely additive 
(Figure 9), as would be the case if both Dnmtl and Dnmt3a/3b were independent participants 
in the maintenance and de novo methylation of these sequences. Rather, methylation of these 
sequences appears to require cooperation both Dnmtl and Dnmt 3a/3b activities. The most 
straightforward interpretation of these data is that Dnmt3a and/or 3b are necessary for 
1 5 ongoing de novo methylation of these sequences to compensate for poor maintenance 
methylation by Dnmtl. If either of these activities is absent, the methylation of these 
sequences cannot be properly maintained. 
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EXAMPLE 7 

(A new assay for the detection of hemi-methylation; Poor maintenance 
methylation was balanced by a continuing high rate of de novo methylation mediated by 

Dnmt3a and/or Dnmt3b) 

One of the sequences that was poorly maintained by Dnmtl alone was further 
investigated, and showed that it had a surprisingly high level of hemi-methylation, even in 
wild-type cells, suggesting poor maintenance methylation balanced by a continuing high rate 
of de novo methylation mediated by Dnmt3a and/or Dnmt3b. This study used the inventive 
hemi-methylation assay, described herein above. Prior to the development of this novel and 
straightforward method there had been no accurate way to determine hemimethylation levels 
at specific CpG dinucleotides in the genome. 
30 Further evidence that Dnmt3a and/or Dnmt3b are responsible for the compensating de 

novo methylation was provided by the fact that these enzymes could restore methylation to 
pretreatment levels following transient exposure of cells to 5-Aza-CdR, whereas Dnmtl 
could not. 

Dnmtl by itself was also found herein to be incapable of restoring methylation of 
sequences that it had been able to maintain prior to the 5-Aza-CdR treatment, suggesting that 
its ability to de novo methylation is dependent on the presence of a critical level of pre- 
existing methylation at CpG sites. 

A New Assay for the Detection of Hemi-methylation. One prediction of poor 
40 maintenance methylation of Class n sequences, offset by continuing de novo methylation is 
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that the level of hemi-methylation at these sites (i.e., when only a single cytosine in the 
palindrome is methylated) should be substantially higher than in sequences with low levels of 
de novo methylation and very efficient maintenance methylation following each round of 
DNA replication. 

5 Testing of this prediction required the development of a way to measure levels of 

hemi-methylation at individual CpG dinucleo tides. The present invention provides such an 
invention as described herein under Examples 1-5 {see Figures 2 and 3). The assay was 
expanded to cover additional class I and class II genomic sequences. 

Figure 10 shows the results of the hemi-methylation analysis of a Class I sequence 

10 (Cl-f) and of a Class II sequence (CH-d). There was a surprisingly high level of hemi- 
methylation of Cll-d sequences in wild-type (M1/3A/3B) cells, consistent with the 
interpretation that the methylation of these sites is poorly maintained by Dnmtl, and requires 
ongoing de novo methylation mediated by Dnmt3a and/or Dnmt3b. Hemi-methylation levels 
of Cl-f were substantially lower, as would be expected for a site that is subject to efficient 

1 5 maintenance methylation by Dnmtl . Control experiments of this sequences in M3A/3B cells 
demonstrated that equal amounts of hemi-methylation were present on top and bottom strands 
(Fig. 5). 

Cooperativity Between Enzymes after 5-Aza-2 '-deoxycytidine Treatment. Treatment 

20 of dividing cells with the DNA methyltransferase inhibitor 5-aza-CdR results in a transient 
demethylation of the genome, followed by a rebound of methylation levels in subsequent cell 
divisions. Since such rebound methylation requires de novo methylation activity, this type of 
analysis can provide an independent assessment of the de novo methylation capabilities of the 
various DNA methyltransferase enzymes at particular regions. This could lend further 

25 support to the proposition that Dnmt3a and/or Dnmt3b are responsible for ongoing de novo 
methylation of Class II sequences. 

The knockout cells were, therefore, exposed to 5-aza-CdR and followed the kinetics 
of remethylation of the Cl-f and Cll-d fragments were followed to determine how the Dnmt's 
interacted to restore methylation levels after perturbation of the equilibrium in the ES cell 

30 types (Figure 11). The results showed that methylation levels reached their minimum three 
days after drug treatment in wild type, and that the methylation of both fragments was 
restored to near pre-treatment levels by 14 days after treatment. Figure 1 1 also shows that the 
combination of Dnmt3a with 3b (M3A/3B) was sufficient to result in the rapid remethylation 
of the Cll-d but not the Cl-f fragment. 

35 Surprisingly, the Dnmtl enzyme by itself failed to restore the methylation of 

either fragment after transient drug treatment (Figure 1 1). The Ml cells showed a continued 
loss of methylation after treatment, suggesting that once the equilibrium had been perturbed 
below a certain level by 5-aza-CdR the Dnmtl by itself or an as yet unidentified 
methyltransferase had insufficient de novo methylation activity for the restoration of 

40 methylation levels to pre-treatment levels. In contrast, the combined activities of Dnmt3a 
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and Dnmt3b resulted in the full restoration of pretreatment levels of methylation at the Cll-d 
sequence. These results suggest that the Cll-d sequence is an efficient target for de novo 
methylation by Dnmt3a and/or Dnmt3b, but not by Dnmtl . This fits a model in which 
ongoing de novo methylating activity of Dnmt3a and/or 3b compensate for poor maintenance 
of this sequence by Dnmtl. 



EXAMPLE 8 

(Dnmt3a and/or Dnmt3b methylation occured close to the time of DNA replication, 
while Dnmtl showed a substantial amount of delayed methylation) 

Methylation by Dnmt3a and/or Dnmt3b occurs close to the time of DNA replication, 
while Dnmtl shows a substantial amount of delayed methylation, extending beyond one hour 
post DNA synthesis. However, this delay in maintenance methylation by Dnmtl was not 
responsible for the sequence-dependent variability in methylation levels in Dnmt3a/3b- 
deficient cells, since both types of sequences showed this maintenance methylation delay. 
We conclude that the major distinction between sites that are well maintained by Dnmtl, 
versus those that are not, lies in the efficiency of post-replicative maintenance methylation 
efficiency by Dnmtl, rather than in a difference in de novo methylation, or in delayed 
maintenance methylation. 



Kinetics of Methylation of Newly Synthesized DNA. The balance of poor maintenance 
methylation of Class II sequences, with a compensating de novo methylation by Dnmt3a 
and/or Dnmt3b raises the question whether both of these methylation activities occur 
concurrently immediately post replication, or whether the de novo methylation occurs at 
some other time in the cell cycle. A 1-hr BrdU pulse followed by chasing for various times 
was used to investigate the timing of methylation with respect to synthesis of the two DNA 
sequence classes. 

DNA containing BrdU was isolated by immunoprecipitation using standard methods, 
and the methylation status of CpG sites within the fragments measured by cloning of 
individual DNA molecules and quantitation of methylation levels or by Ms-SNuPE analysis. 

Figure 12 summarizes results from such pulse-chase experiments which show that 
DNA synthesized during a 1-hr pulse of BrdU in wild-type (M1/3A/3B) cells was not 
methylated to its final level immediately after synthesis but that some further methylation 
occurred in the post-synthetic phase measured at 3.5 and 24 nr. However, this did not appear 
to be de novo methylation attributable to Dnmt3a and/or Dnmt3b, since M3A/3B cells did not 
show additional methylation after the first hour. Indeed, the delayed increase in methylation 
was also evident for the Cl-f sequence in Ml cells. Prior demonstration that Dnmtl is 
localized to the replication machinery in the S-phase nucleus and interacts with PCNA has 
led to the expectation in the art that the methylation pattern is copied immediately after 
synthesis. However, some earlier prior art studies have suggested that some methylation 
might be delayed, with cytosine methylation occurring in a biphasic fashion. The present 
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results are consistent with these earlier studies. 

In conclusion, most methylation by Dnmt3a and/or Dnmt3b occured immediately 
before or soon after DNA synthesis, while Dnmtl continued to display maintenance activity 
more than 1 hour post replication. 

5 

Summary 
(Examples 1-8) 

The functions of the three known DNA methyltransferases Dnmtl, Dnmt3a and 3b in 
establishing and maintaining methylation patterns are not fully understood. According to the 

10 present invention, mouse embryonic stem cells (ES) with systematic gene knockouts for 

Dnmtl, Dnmt3a and 3b were used to show that DNA methylation in a CpG-poor transcribed 
sequence occurs in two phases with respect to DNA synthesis. BrdU pulse chase 
experiments showed that Dnmtl functioned at or soon after replication to methylate the 
newly synthesized strand to approximately 72% of the equilibrium level, but leaves patches 

15 of hemimethylated DNA. The restoration of the pattern was significantly facilitated by 

Dnmt3a and 3b which appear to be processive enzymes whose activities fill in the patches in 
addition to their roles as de novo methylases. The present invention provides novel methods 
to measure hemi-methylation and to calculate methylation rates. These novel methods were 
used herein to conclude that Dnmt3a and 3b act mainly as maintenance methyltransferases to 

20 ensure the completeness of methylation of the genome in ES cells. 

The central and causative roles of DNA methylation in developmental processes such 
as imprinting and in pathological situations such as cancer, ICF and Rett syndromes are 
increasingly being appreciated. This has stimulated research to understand how methylation 
patterns are acquired and maintained. The recent cloning and preliminary characterization of 

25 the Dnmt3a and 3b enzymes has led to the suggestion that they might be primarily 

responsible for de novo methylation by establishing a pattern which is copied by the better 
characterized Dnmtl. 

According to the present invention, Dnmt3a and 3b act as de novo enzymes, and 
likely have more important roles in non-differentiating cells may be to assist Dnmtl in 

30 ensuring full maintenance of methylation patterns. 

According to the present invention, the methylation of a region of DNA, typical of a 
mammalian gene with respect to the paucity of CpG, occurred in two phases and therefore 
discontinuously (summarized in the model shown in Figure 6A, B and C). 

Figures 6A, B and C illustrate the proposed model for the interactivity between DNA 

35 methyltransferase enzymes in ES cells. This figure represents a highly schematic 

interpretation of the data of the present invention. Although the data is based on the analysis 
of four contiguous CpGs, and of one CpG dinucleotide in particular detail, the conclusions 
have been summarized using a hypothetical replication fork in which evenly spaced CpG 
dinucleotides are represented by circles. Black circles indicate methylated CpGs on the 

40 parent DNA strands, while grey circles represent newly-methylated CpGs on the daughter 
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strand. Unmethylated CpGs are indicated by white circles. PCNA is indicated by a ring. 
DNA methyltransferases are indicated by rectangles (Dnmtl) or ovals (Dnmt3a and/or 3b). 

Figure 6A depicts how methylation patterns are maintained in wild-type cells 
containing all three DNA methyltransferase enzymes. Most DNA synthesized in wild-type 
cells is likely immediately methylated after synthesis by Dnmtl which is presumably closely 
associated with the replication machinery through its interaction with PCNA There are 
regions of DNA, in which patches of hemimethylation occur, and these appear to serve as 
substrates for the DnmOa and 3b enzymes The full copying of the pattern requires the 
concerted efforts of all three enzymes. 

Figure 6B indicates how methylation is proposed to occur in cells lacking both 
Dnmt3a and Dnmt3b enzymes. Individual CpGs, or patches of CpGs missed by Dnmtl 
remain unmethylated, since Dnmt3a and Dnmt3b are lacking. This results in a higher 
incidence of unmethylated CpGs surviving to the next round of replication. Some of these 
CpGs are methylated in the next round of replication by Dnmtl, so that by 24 hours, there are 
relatively few large patches of unmethylated CpGs in Ml cells (see Figure 5). 

Figure 6C shows the situation in M3A/3B cells, which lack Dnmtl. A mosaic 
methylation pattern is present with patches of methylation and unmethylation and a 
substantial fraction of hemi-methylated sites. It is likely that in the absence of Dnmtl, 
Dnmt3a and/or 3b are capable of some copying of the existing methylation pattern, but are 
not particularly efficient. Furthermore, the enzymes also seem capable of a de novo 
modification of both strands of DNA thus resulting in a mosaic pattern of methylation within 
these cell types. 

The first phase of methylation, in which CpG sites are methylated to approximately 
72% of their final levels, is most likely accomplished by Dnmtl, which has been shown to be 
associated with DNA replication machines by the elegant work of Leonhardt et al. (1992) and 
Rountree et al. (2000) and confirmed here. 

Significantly, the enzyme left gaps of unmethylated sites linked to each other as 
shown by the direct sequencing of BrdU pulse labeled DNA. The size of these gaps or 
patches was not established, but they were at least 300 bp which is the distance between sites 
1 and 4 in me fragment studied herein. This DNA contained a substantial number of 
hemimethylated sites, which were converted into fully methylated sites during the post 
synthetic phase. The delayed methylation was most likely accomplished by Dnmt3a/3b, 
which must therefore be acting as hemimethylases which seemed to be processive, yet not 
100% efficient, enzymes. 

The evidence for this processivity came from analysis of the M3A/3B cells which 
lacked Dnmtl in which 72% of the molecules analyzed had no methylation whereas 14% had 
more than one site methylated. Putting these two properties together probably assures the 
stability of the pattern. However, it is also possible that the enzymes are not processive but 
rather that the presence of methylation attracts further methylation to the region. 

Novel Hemi-methylation Assay. The present invention provides a novel method for 



29 



WO 02/38811 PCT/US01/47141 
the quantitation of hemi-methylation, which requires complete digestion of DNA with Hp all 
before bisulfite treatment to ensure accuracy. This was accomplished by the use of an excess 
of enzyme, and replenishment of enzyme after 16 hr incubation. The method is robust and 
has general utility in other experimental and applied applications. 
5 The occurrence of a high percentage of hemimethylated sites in the M3 A/3B cells 

seemed, at first sight, to not fit with the suggestion that the Dnmt3a and 3b enzymes acted as 
hemi-methylases to fill in gaps in wild-type cells. This apparent paradox could be explained 
if the overall level of cytosine methyltransferase activity in the M3 A/3B cells was inadequate 
for effective methylation maintenance. This may indeed be the case since the combined rates 
10 ofde novo and of maintenance methylation present in M3A/3B cells (0.07 + 0.37 = 0.44) was 
substantially lower than that of Ml cells (0.39 + 0.75 = 1.14) or of wild-type cells (0.50 + 
0.94 = 1.44). Okano et al. (1998) found that Dnmt3a and Dnmt3b had substantially lower 
levels of activity than Dnmtl in the test tube. The pattern in the M3A/3B cells is therefore 
highly mosaic with a significant percentage of hemimethylated patches which the Dnmt3a 
15 and 3b are unable to restore to a fully methylated configuration. 

The overall rates described herein above were calculated from the measured steady- 
state levels of DNA methylation in cells with differing complements of DNA methylases. 
Nevertheless, the resulting rates appear to be mutually consistent, and suggest relatively* little 
overlap in function. For instance, the combined de novo and maintenance methylation rates 
20 calculated for Ml cells (1.14) added to the combined de novo and maintenance methylation 
rates calculated for M3A/3B cells (0.44) equaled 1 .58. This is not far from the calculated 
overall activity for M1/3A/3B cells of 1.44 suggesting that the activities are largely 
complementary rather than overlapping. 

The equations of the present invention assume that all de novo methylation leads to a 
25 fully methylated product. Additionally, calculation methods were derived that consider de 

novo methyltransferase activity resulting in hemimethylation. However, the inclusion of such 
activities for purposes of the present invention leads to a less optimal fit with the 
experimental data. Additionally, experimental evidence from prokaryotic methyltransferases 
indicates that de novo methyltransferases act sequentially on the two opposing DNA strands, 
30 with the methylation of the cytosine on the first strand being rate-limiting. 

The equations of the present invention assume that the top and bottom strands are 
equally methylated, a supposition supported in several preliminary experiments that were 
performed. 

Calculations of the rates of hemimethylation, according to the present invention, 
35 assumes that the methylation state of a DNA strand reaches the measured equilibrium level 
within a single cell division. In reality, the BrdU data suggested that, although DNA strands 
acquired most of their methylation within the first cell division, they continued to acquire 
additional methylation as they aged with each cell division. 

Therefore, a pool of DNA molecules from a population of cells is comprised of many 
40 recently synthesized strands with DNA methylation levels at, or slightly below the average 
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level for the entire population, as well as older DNA strands with higher levels of 
methylation. The relative prevalence of a DNA strand in the population decreases as a 
function of the number of rounds of DNA replication since its initial synthesis. These 
"strands" may represent long patches, rather than entire molecules (chromosomes), since 
recombination and repair-associated DNA synthesis could disrupt their contiguity. 

The model, according to the present invention, supports a mechanism, whereby DNA 
strands continue to become increasingly methylated as they age. As a consequence, the rate 
calculations of the present invention should be viewed as weighted averages for the entire 
population of DNA strands of different age. Any de novo methylation that occurs on aging 
DNA strands could not be removed passively by DNA replication so that the only way to 
counteract this cumulative methylation would be to invoke active, enzymatic demethylation. 
Although these aging strands represent an increasingly small fraction of the population, they 
could have biological consequences if they reach a threshold of DNA methylation that results 
in a phenotypic effect, such silencing a tumor-suppressor gene, leading to clonal outgrowth. 
This previously unrecognized phenomenon may explain, in part, the increase in CpG island 
hypermethylation observed in aging (Ahuja and Issa, 2000) and in cancer (Jones and Laird, 
1999). 

In Ml cells lacking DnrnOa and 3b, there was clear evidence for two classes of 
molecules with 50% of molecules newly synthesized during the 1-hr BrdU pulse having 2 or 
less sites concurrently methylated and the remainder containing 3 or 4 sites methylated (Fig. 
5). The biphasic nature of the distribution was consistent with the idea that the leading and 
lagging strands were differentially handled by the methylases, although this has not been 
found to be the case with replication origins (Araujo et al., 1998). It was, however, 
interesting that the Dnmt3a and 3b enzymes showed differential nuclear localization, and 
further work will be necessary to determine whether the methylation kinetics of the leading 
and lagging strands differ. 

The interpretation of data obtained according to the present invention relies to a 
significant extent on the use of BrdU to tag the newly synthesized DNA for subsequent 
analysis. It is therefore important to question whether the incorporation of this molecule into 
30 DNA might alter its post-synthetic methylation patterns. This is unlikely since early 

experiments showed directly that BrdU does not cause gross changes in methylation levels 
even at levels where 95% of thymines had been substituted by the analog (Singer et al 
1977). 

Significantly, the methylation properties of a non-CpG island region of DNA was 
35 measured. Although most genomic methylation occurs in such sequences, it will be 

important to see whether the same cooperation between the enzymes is evident with CpG 
islands where modification is more tightly associated with promoter silencing. In this 
regard, it was determined that methylation of repetitive sequences was not well maintained in 
ES cells lacking Dnmt3a and 3b, indicating that at least two methyltransferases are needed to 
40 ensure methylation of these sequences, unlike the fragment studied herein which was 

31 



25 



BNSDOCID: <WO OPTWft11A? 1 > 



WO 02/3881 1 PCT/US01/47141 

substantially methylated in Ml cells. Also, replication origins in mammalian cells have been 
shown to become methylated within 5 min after synthesis (Araujo et al., 1998; Rein et aL, 
1999), suggesting that they may not show the delay apparent with the fragment we are 
studying. 

5 The results of the present invention are strikingly reminiscent of complementary 

activities of other enzyme systems involved in DNA metabolism. For example, DNA 
polymerases a and 5 participate at different stages of DNA synthesis (Waga and Stillman, 
1998). Perhaps the methylation pattern is maintained by enzymes with overlapping yet 
specialized activities. 

10 The data of the present invention shows that an important function of Dnmt3a and 3b 

in embryonic cells is to fill in patches of unmethylated DNA missed by Dnmtl at the 
replication fork. These enzymes therefore have a repair function in addition to their potential 
roles as de novo methyltransferases. 

Finally, Examples 6-8 show that mouse ES cells with systematic gene knockouts for 

15 DNA methyltransferases (Dnmts) can be used to delineate the roles of Dnmtl and Dnmt3a 
and 3b in maintaining methylation patterns in the mouse genome. Dnmtl alone was able to 
maintain methylation of most CpG-poor regions analyzed. In contrast, both Dnmtl and 
Dnmt3a and/or Dnmt3b were required for the methylation of a select class of sequences 
which included abundant murine LINE-1 promoters. 

20 A novel hemi-methylation assay was used to show that even in wildtype cells these 

sequences contain high levels of hemi-methylated DNA, suggestive of poor maintenance 
methylation. Dnmt3a and/or 3b could restore methylation of these sequences to pretreatment 
levels following transient exposure of cells to 5-Aza-CdR, whereas Dnmtl by itself could 
not. 

25 In conclusion, ongoing de novo methylation by Dnmt3a and/or Dnmt3b compensates 

for inefficient maintenance methylation by Dnmtl of these endogenous repetitive sequences. 
The present results reveal a previously unrecognized degree of cooperativity among 
mammalian DNA methyltransferases in ES cells. 
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We claim: 

1. A method to detennine the fraction of DNA molecules that is hemi-methylated 
at a specific CpG dinucleotide sequence within a palindromic CpG methylation site, in a pool 
of DNA molecules having mixed DNA methylation states at that palindromic CpG site, 
comprising: 

obtaining an isolated genomic DNA sample; 

digesting the DNA sample with an excess of a methylation-sensitive restriction 
endonuclease that cleaves according to a recognition sequence motif comprising a 
pahndromic CpG methylation site, whereby the endonuclease cleaves the DNA at a specific 
recognition motif position if the methylation state of the respective palindromic CpG 
methylation site is un-methylated, but not hemi-methylated or fully-methylated; 

treating the endonuclease-digested DNA with bisulfite, whereby unmethylated 
cytosme residues on both strands of the DNA are converted to uracil; 

amplifying one strand of the bisulfite-treated digested DNA with primers flanking the 
specific recognition motif position comprising a bisulfite-treated specific CpG dinucleotide of 
the pahndromic CpG methylation site, whereby any uracil residues are converted to 
thymidine residues; and 

determining, by sequence analysis of the amplification products or of individual 
clones thereof, the presence of either cytosine or thymidine at the position corresponding to 
the first nucleotide position of the specific CpG dinucleotide, whereby the fraction of DNA 
molecules that is hemi-methylated at the specific CpG dinucleotide is determined by 
averaging results obtained by the sequence analysis. 

2. The method of claim 1, wherein the restriction endonculease is HpaH. 

25 3 * The method of claim L wherein amplifying the bisulfite-treated DNA occurs 

byPCR. " 

4. The method of claim 1, wherein sequence analysis is performed by a 
sequencing method selected from the group consisting of Ms-SNuPE, COBRA, DGE 
MethyLight, MSP, MCA, direct sequencing, hybridization arrays, and combinations thereof. 
30 5 ; ^ method of claim 1, further comprising digestion of the genomic DNA 

sample with a second restriction endonuclease. 

6. The method of claim 5, wherein the second restriction endonuclease is Rsal. 

7. A method to calculate at least one of the rate of de novo methylation (n) or the 
rate of maintenance methylation (/») at a specific CpG dinucleotide sequence within a 

35 palindromic CpG methylation site, comprising: 

determining a value for S, where S is the percent of cytosines at a specific CpG 
dinucleotide found, by the method of claim 1, to be unmethylated by bisulfite analysis of one 
of the two single DNA strands; 



20 



40 



determining a value for P, where P is the percent of cytosines at the specific CpG 
dinucleotide found, by the method of claim 1, to be methylated bybisulfite analysis of one of 
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the two DNA strands; 

deteraiining D, as D is 100-2S; 

determining H, as H is 2SP/(S+D); 

determining F, as F is DP/(S+D); and 
5 determining U, as U is 100-H-F, whereby the rate of de novo methylation (n) at the 

specific CpG dinucleotide sequence is defined as: n = l-(U/(U+H/2)), where n is the fraction 
of unmethylated post-DNA synthesis specific CpG dinucleotide sequences that undergoes 
conversion to fully methylated CpG per cell division, and whereby the rate of maintenance 
methylation (m) at the specific CpG dinucleotide sequence is defined as: m = l-(H/(F+H/2)), 
1 0 where m is the fraction of post-DNA synthesis hemi-methylated specific CpG dinucleotide 
sequences that undergoes conversion to fixlly methylated CpG per cell division. 

8. The method of claim 7, wherein determination of S and P are based on 
analysis of the same DNA strand. 

9. A method to calculate the rate of maintenance methylation (in) at a specific 
15 CpG dinucleotide sequence within a palindromic CpG methylation site, comprising: 

determining a value for S, where S is the percent of cytosines at a specific CpG 
dinucleotide found, by the method of claim 1, to be unmethylated by bisulfite analysis of one 
of the two single DNA strands; 

determining a value for P, where P is the percent of cytosines at the specific CpG 
20 dinucleotide found, by the method of claim 1, to be methylated by bisulfite analysis of one of 
the two DNA strands; 

determining D, as D is 100-2S; 

determining H, as H is 2SP/(S+D); and 

determining F, as F is DP/(S+D), whereby the rate of maintenance methylation (m) at 
25 the specific CpG dinucleotide sequence is defined as: m = l-(H/(F+H/2)), where m is the 
fraction of post-synthesis hemimethylated specific CpG dinucleotide sequences that 
undergoes conversion to fully methylated CpG per cell division. 

10. The method of claim 9, wherein determination of S and P are based on 
analysis of the same DNA strand. 

30 1 1 . A methylation detection kit, in one or more containers, useful for measuring 

the fraction of DNA molecules that is hemi-methylated at a specific CpG dinucleotide in a 
pool of DNA with mixed DNA methylation states at that CpG, comprising: 

a methylation-sensitive restriction endonuclease that cleaves according to a 
recognition sequence motif comprising a palindromic CpG methylation site, whereby the 
35 endonuclease cleaves the DNA at a specific recognition motif position if the methylation 
state of the respective palindromic CpG methylation site is un-methylated, but not hemi- 
methylated or fully-methylated; and 

amplification primers flanking the specific recognition motif position. 

1 2. The kit of claim 1 1 , further comprising bisulfite treatment reagents. 
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Sequence Listing 

<110> University of Southern California; Jones, Peter A. ; 
Liang, Gangning; Tomigahara, Yoshitaka; Laird, Peter W. 

<120> A New Assay for the Detection and Quantitation of 
Hemimethylation 

<130> 47675-16 

<150> US 60/247,191 

<151> 2000-11-08 

<160> 12 

<170> Word 98 

<210> 1 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Cl-f top strand primer 
<400> 1 

aatgtgtaat atttttatgg tttttttaga atgg 34 
<210> 2 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Cl-f top strand primer 
<400> 2 

t^acaaaaaa aatacctctt ccttactaaa c 31 
<210> 3 
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<211> 26 
<2-12> DNA 

<213> Artificial Sequence 
<220> 

<223> Cl-f bottom strand primer 
<400> 3 

tggtatttgg aaagattgta ggaaag 26 
<210> 4 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Cl-f bottom strand primer 
<400 4 

caatacctct ataacccttc caaa 24 
<210> 5 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Cl-f top strand Ms-SNuPE primer 
<400> 5 

aataattttg tttttttgga tatt 24 
<210> 6 
<211> 22 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Cl-f top strand Ms-SNuPE primer 
<400> 6 

aaattttgtt tttggttgta aa 22 
<210> 7 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Cl-f bottom strand Ms-SNuPE primer 
<400> 7 

aggaatagaa tttgagatat t 21 
<210> 8 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Cl-f bottom strand Ms-SNuPE primer 
<400> 8 

taaattgttt taattagatt aataa 25 
<210> 9 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Cl-f site 1 Ms-SNuPE primer 
<400> 9 

tatggttttt ttagaatgg 19 
<210> 10 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Cl-f site 2 Ms-SNuPE primer 
<400> 10 

aataattttg tttttttgga tatt 24 
<210> 11 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Cl-f site 3 Ms-SNuPE primer 
<400> 11 

aaattttgtt tttggttgta aa 22 
<210> 12 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Cl-f site 4 Ms-SNuPE primer 
<400> 12 

ttgttgttat gtgtaattat ttttt 25 
<210> 13 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> AP-PCR primer 
<400> 13 

cacatggttc tgc 13 
<210> 14 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> AP-PCR primer 
<400> 14 

gtctctatga ccc 13 
<210> 15 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> AP-PCR primer 
<400> 15 
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cttactgtgc cac 13 
<210> 16 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Cl-f bottom strand primer 
<400> 16 

gagtaaagat agaataaatt gttttaatta g 31 
<210> 17 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Cll-d" 5' -primer 
<400> 17 

gtttataggt ttagaggttt t 21 
<210> 18 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Cll-d 3' -primer 
<400> 18 

aacacataaa cctattttaa actta 25 
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<210> 19 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> "A-Repeats" 5' -primer 
<400> 19 

tgatttattt attagaggtt ttagg 25 
<210> 20 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> "A-Repeats" 3 '-primer 
<400> 20 

acataaaaaa acaaactacc c 21 
<210> 21 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> CH-d site 1 primer 
<400> 21 

ttttatttat tgttattatg g 21 
<210> 22 
<211> 16 



PCT/US01/47141 



WO 02/38811 



PCT/US01/47141 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Cll-d site 2 primer 
<400> 22 

ggtatagttt gagtat 16 
<210> 23 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Cll-d site 3 primer 
<400> 23 

tattttttaa tagtattatt ttttat 26 
<210> 24 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Cll-d site 1 sequencing primer 
<400> 24 

ttttatttat tgttattatg g 21 
<210> 25 
<211> 16 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Cll-d site 2 sequencing primer 
<400> 25 

ggtatagttt gagtat 16 
<210> 26 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
' <220> 

<223> Cll-d site 3 sequencing primer 
<400> 26 

ttttttaata aggttatatt tttt 24 
<210> 27 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Cll-d site 4 sequencing primer 
<400> 27 

tattttttaa tagtattatt ttttat 26 
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